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This  dissertation  presents  several  advances  in  data  envelopment  analysis  (DEA), 
a  method  for  assessing  the  efficiency  of  decision  units  through  the  identification  of  empir¬ 
ical  best-practice  frontiers.  First,  a  new  hieretrchical  decomposition  approach  for  solving 
large-scale  problems  is  described  with  results  of  computational  testing  in  both  seriaJ  and 
parallel  environments,  that  dramatically  reduces  the  solution  time  for  realistic  DEA  ap¬ 
plications.  Second,  a  new  set  of  models  for  stratifying  and  ranking  decision  units  provides 
important  newer  insights  into  the  relationships  among  the  units  than  what  was  possible 
with  traditional  frontier  analysis.  Because  of  the  intensive  computational  requirements 
of  these  models,  their  practicality  builds  on  the  effectiveness  of  the  hierarchical  process. 
Finally,  a  new  means  of  assessing  the  robustness  of  a  decision-unit’s  efficiency  is  given 
which  spans  all  current  models  and  assists  managers  in  their  evaluation  of  process  and 
organizational  improvement  options.  It  is  expected  that  these  advances  will  permit 
practitioners  and  researchers  to  be  more  expansive  and  ambitious  in  their  use  of  this 
important  class  of  models,  and  will  hopefully  encourage  new  and  even  more  exciting 
applications  of  DEA. 
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CHAPTER  I 


MEASURING  EFFICIENCY  WITH 
DATA  ENVELOPMENT  ANALYSIS 

1.1.  Overview 

In  both  competitive  and  regulated  markets,  measuring  the  productive  efficiency 
of  firms  and  industries  is  important  to  economic  theorists,  policy  makers,  and  production 
managers.  For  e3cample,  the  measurement  of  productive  efficiency  before  and  after 
policy  changes  are  enacted  on  a  firm’s  operations  will  reveal  the  effects  of  those  changes. 
These  measures  of  efficiency  can  also  assist  managers  in  identifying  those  portions  of  the 
production  process  that,  if  made  more  efficient,  would  increase  output,  without  absorbing 
more  input  resources.  For  private  industry,  this  can  allow  the  competitive  firm  to  increase 
market  share,  effectively  translate  raw  resources  to  consumable  outputs,  and  pass  the 
savings  on  to  the  customer.  For  regulated  industries,  inefficiencies  can  be  revealed  and 
policy  effectiveness  measmed. 

1.1.1.  Firm  Productivity:  Early 
Efficiency  Measures 

One  early  approach  to  efficiency  measurement  had  been  to  provide  a  mathemati¬ 
cal  representation  of  a  firm’s  economic  activity,  based  on  a  set  of  behavioral  assumptions. 
The  actual  behavior  of  a  firm  is  compared  to  the  model  to  determine  the  effectiveness  of 
the  firm  at  achieving  productive  efficiency. 

Unfortimately,  establishing  a  mathematical  representation  for  even  a  simple  pro¬ 
duction  process  can  be  a  difficult  task.  Since  the  measured  results  of  the  model  de¬ 
pend  directly  on  the  behavioral  asscimptions,  the  validity  of  the  assumptions  must  be 
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scrutinized  carefully.  Moreover,  in  practice,  policy  makers  are  seldom  able  to  agree  on 
which  factors  to  include  in  the  model  formulation  and  as  different  models  are  employed, 
results  can  vary  greatly.  Consequently  the  results  of  the  mathematical  model  may  not 
reflect  true  economic  activity  as  much  as  the  degree  to  which  they  reflect  the  assumptions 
made  in  designing  the  model. 

Another  limitation  of  this  modeling  approach  is  that  there  are  few  common  be¬ 
havioral  assumptions  that  exist  across  a  variety  of  industries.  Therefore,  a  new  complex 
mathematical  model  must  be  developed  for  each  industrial  application.  In  addition,  once 
a  model  is  established  for  an  industry,  the  model  may  be  applicable  only  to  a  specific  time 
period.  This  lack  of  a  unifying  methodology,  based  on  a  limited  number  of  assmnptions, 
for  all  industries  and  time  frames,  proved  to  be  unsatisfactory.  Therefore,  this  approach 
to  measuring  efficiency  has  not  achieved  wide  acceptance. 

Because  of  the  above  limitations,  other  approaches  were  developed  which  did  not 
involve  building  mathematical  models,  but  centered  directly  on  measuring  the  efficiency 
of  a  firm  by  observing  the  quantity  of  inputs  consumed  and  the  quantity  of  outputs 
produced.  Those  firms  producing  more  output  per  unit  of  input,  compared  to  other  firms, 
would  naturally  be  considered  more  efficient.  Such  engineering-style  ratios  of  output  to 
input  could  also  be  used  to  rank  the  firms  by  order  of  efficiency. 

For  the  case  of  a  single  input  and  a  single  output  this  ratio  analysis  is  straightfor¬ 
ward.  But  for  the  more  common  case  of  a  firm  with  multiple  inputs  and  multiple  outputs, 
the  measurement  task  becomes  more  complex.  One  approach  has  been  to  form  a  variety 
of  simple  ratios  and  rank  the  firms  using  a  prioritization  of  the  ratios.  Determining 
the  relevant  ratios  and  assigning  the  appropriate  priority  to  each  proved  to  be  tenuous. 
Attempts  to  aggregate  the  outputs  into  a  single  weighted  meastire  and  form  a  ratio  with 
aggregated  inputs  has  also  been  offered.  However,  no  consensus  emerged  regarding  the 
appropriate  weighted  index  value  to  apply  to  each  output  and  input. 
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1.1.2.  Modem  Efficiency  Measurement 

In  1957,  M.  J.  Farrell  [51]  proposed  a  solution  to  the  problem  of  identifying  a 
single  measure  of  productive  efficiency  for  processes  which  transform  multiple  inputs  into 
multiple  outputs.  Based  on  extensions  of  ratio  anal3^is,  he  was  able  to  avoid  the  index 
number  problems  that  characterized  previous  approaches.  Because  of  the  computational 
resources  available  at  the  time,  Farrell  restricted  his  attention  to  applications  involving  a 
single  output  but  with  multiple  inputs. 

As  computer  technology  advanced,  Farrell’s  methodology  for  determining  produc¬ 
tive  efficiency  drew  increased  attention.  In  1978,  Chames,  Cooper,  and  Rhodes  [37]  used 
fractional  programming  techniques  to  demonstrate  the  equivalence  of  Farrell’s  non-linear 
measure  to  a  well-defined  linear-programming  formulation.  Whereas  Farrell’s  metric  had 
been  limited  to  the  case  of  a  single  output  because  of  the  complexity  of  solving  the 
more  generalized  approach,  the  Chames,  Cooper,  and  Rhodes  (CCR)  method  offered  an 
elegant  reformulation  and  solution  procedure  for  the  multiple-output  multiple-input  case. 
Theorists,  practitioners,  policy  makers,  and  managers  now  had  a  means  of  measuring  the 
relative  efficiency  for  all  firms  of  interest.  This  technique  is  called  data  envelopment 
analysis  (DEA).  Since  the  original  CCR  paper,  over  400  articles  have  been  published  re¬ 
garding  usage  and  extensions  of  DEA  [80].  This  growing  acceptance  among  theoreticians 
and  practitioners  refiects  an  emerging  interest  in  obtaining  a  single  efficiency  measure  to 
evaluate  a  firm’s  productivity. 

As  DEA  has  gained  in  popularity,  the  need  to  examine  the  computational  char¬ 
acteristics  of  the  DEA  formulations  has  become  evident.  Some  general-purpose  linear 
programming  codes,  when  applied  to  DEA,  have  proven  to  be  imreUable,  computationally 
expensive  (in  terms  of  time  to  solution),  and  limited  in  the  size  of  problems  that  can  be 
solved;  as  a  result,  they  limit  the  full  usefulness  of  the  advantages  that  the  DEA  approach 
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offers.  Surprisingly,  with  the  explosive  growth  of  DEA  over  the  last  decade,  little  has 
been  written  to  specifically  address  the  issues  concerning  the  computational  aspects  of 
DEA  models. 


1.1.3.  Objectives  of  the  Research 

This  dissertation  contributes  to  the  development  of  the  DEA  methodology  by 
examining  the  computational  issues  of  the  foimdational  models.  One  purpose  of  this  work 
is  to  identify  the  special  structural  characteristics  associated  with  the  DEA  formulations 
which,  when  exploited,  can  result  in  significant  computational  improvements  in  solution 
times. 

As  newer  computer  technology  is  used,  these  improvements  become  more  relevant. 
Experience  indicates  that  advances  in  mathematical  programming  have  been  closely 
linked  to  advances  in  computer  technology.  Consequently,  the  breadth  of  appUcations 
for  data  envelopment  analysis  will  grow  wider  as  the  computational  advantages  brought 
by  newer  technologies  are  identified  and  employed.  Also,  since  DEA  spans  the  fields 
of  Operations  Research  and  Economics,  improved  computational  tools  will  magnify  the 
synergism  existing  in  the  interdisciplinary  nature  of  DEA,  yielding  new  insights  and 
expanded  appUcations. 

Specifically,  this  dissertation  introduces  a  parallel  programming  approach  to  solve 
large-scale  DEA  problems.  Because  of  their  nature,  DEA  models  are  ideal  candidates 
for  exploiting  the  relatively  new  parallel  computing  machinery.  During  the  process  of 
adapting  DEA  to  this  new  technology,  new  insights  into  the  DEA  problems  were  realized. 
This  dissertation  will  identify  improvements  that  can  be  made  to  all  DEA  codes  (both 
serial  and  paraUel),  answer  problematic  concerns  that  have  been  revealed  in  DEA,  and 
present  new  approaches  for  DEA  analysis  that  are  possible  with  the  new  computationally 
efficient  codes. 
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In  addition,  a  new  DEA  model  is  presented  for  grouping  and  rank  ordering  the 
firm’s  operations  and  accounting  for  possible  outliers  in  the  data.  Also,  a  new  approach 
to  DEA  sensitivity  analysis  helps  paint  a  picture  of  the  robustness  of  the  operations  of  a 
particular  evaluation  unit.  Both  of  these  advances  build  upon  the  gains  realized  in  the 
computational  research. 


1.2.  Notation  and  Conventions 

Because  the  DEA  measure  of  efficiency  is  derived  firom  the  solution  of  a  math¬ 
ematical  programming  model,  a  description  of  the  notation  and  conventions  whidi  will 
be  used  throughout  this  dissertation  is  necessary.  As  much  as  possible,  these  notations 
will  conform  to  the  standard  emerging  in  the  DEA  literature.  Matrices  are  denote 
bold  upper  case  Latin  letters.  The  column  of  the  matrix  X  is  denoted  by  Xj  ano 
element  in  the  row  and  column  is  described  as  Xy.  Small  Greek  and  Latin  bold 
letters  represent  vectors;  the  element  of  vector  X  is  given  by  Aj.  The  Greek  letter  e  is 
an  arbitrarily  small  positive  non- Archimedean  scalar  value,  i.e.,  the  value  is  not  explicitly 
defined.  The  vector  of  all  zeroes  is  denoted  by  0.  The  vector  of  aJl  ones  is  denoted  as 
1.  Surplus  and  slack  vectors  of  the  linear  programming  problems  will  be  denoted  by  s. 
Superscripts  will  denote  the  constraints  to  which  the  surplus  or  slack  variables  are  to  be 
associated.  Scalars  are  denoted  by  small  italicized  Latin  and  Greek  letters.  Subscripts  on 
scalars  are  used  to  distinguish  two  or  more  variables  that  share  common  characteristics 
in  the  problem  but  assiune  different  values.  Any  variable  superscripted  by  “*”(e.g.,  z*), 
represents  the  value  the  variable  takes  on  as  part  of  the  optimal  solution  for  the  stated 
problem.  All  vectors  and  matrices  are  treated  in  a  manner  such  that  the  dimensions 
are  conformable  for  multiplication.  A  vector  is  treated  as  a  row  when  it  appears  on 
the  left  of  a  matrix  and  as  a  column  when  it  appears  on  the  right;  thus,  alleviating  the 
need  to  distinguish  between  row  and  column  vectors.  The  inner  product  of  two  vectors 
^  and  Yo  is  denoted  by  fiYo.  Point  sets  are  denoted  by  capital  Latin  letters.  Both 
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subscripts  and  superscripts  may  uniquely  identify  a  set  described  within  a  given  context. 
Elements  of  those  sets  are  denoted  by  small  Latin  letters  with  appropriate  superscripts 
and  subscripts.  The  cardinality  of  set  E  will  be  denoted  |E|. 

1.3.  Characteristics  of  DEA  Efficiency  Measures 

It  is  important  to  tmderstand  what  is  meant  by  efficiency  with  reference  to  DEA, 
since  economists  have  a  multitude  of  definitions  and  measures.  In  simplest  terms,  a 
measure  of  how  well  a  firm  succeeds  in  producing  as  much  of  its  outputs  as  possible  for 
a  given  amount  of  its  inputs  is  called  the  efficiency  of  operations  for  that  firm.  But  what 
is  the  basis  of  comparison?  One  approach  is  to  establish  an  absolute  standard,  usually 
based  on  a  mathematical  model,  by  which  all  firms  will  be  evaluated.  The  shortcomings 
of  this  approach  have  already  been  discussed.  An  alternate  approach  would  be  to 
measure  a  firm’s  performance  relative  to  other  firms  which  are  producing  like  outputs 
using  the  same  type  of  inputs.  Of  course,  the  quantity  of  outputs  produced  as  well  as 
the  amount  of  inputs  consumed  in  the  production  process  will  vary  firom  firm  to  firm. 
It  is  precisely  this  variation  that  will  indicate  a  firm’s  relative  efficiency  compared  to  its 
contemporaries. 

Data  envelopment  analysis  identifies:  those  firms  producing  the  most  output  for 
a  given  amount  of  input  and  those  firms  consuming  the  least  input  to  produce  a  given 
level  of  output.  Figure  1.1  depicts  the  performance  of  a  group  of  firms  (each  shown  as  a 
point  in  the  scatter  diagram)  that  produce  one  t3q>e  of  output  and  consume  one  type  of 
input.  The  efficient  firms  are  termed  technically  efficient  and  form  an  upper  boundary  by 
which  the  remaining  inefficient  firms  axe  compared.^  This  empirically  derived  boundary, 

^  Farrell  [51]  gives  a  complete  description  of  technical  efficiency  which  measures 
a  firm’s  success  in  producing  maximum  output  for  a  given  amount  of  inputs.  This  differs 
from  other  measmes  of  efficiency,  such  as  price  efficiency,  which  measures  the  firm’s 
success  of  choosing  an  optimal  set  of  inputs  to  produce  the  output.  The  efficient  firms 
identified  by  the  Farrell  measure  are  a  subset  of  the  technically  efficient  firms. 
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Figure  1.1.  Total  product  curve 

together  with  the  connecting  line  segments,  is  called  the  efficient  frontier  or  efficient 
production  surface. 

Because  DEA  identifies  technical  efficiency  in  general  terms  of  quantities  of  out¬ 
puts  and  inputs,  the  methodology  can  be  applied  to  competitive  firms,  regulated  firms, 
and  nonprofit  organizations.  This  broad  applicability  contributes  to  the  growing  use 
of  DEA.  However,  the  approach  also  has  shortcoming.  First  of  all,  the  technically 
efficient  subset  of  firms  under  analysis  may  not  be  very  effective  in  translating  inputs 
into  outputs.  Most  would  agree  that  a  firm  engaged  in  a  poor  production  process,  even 
though  performing  better  than  contemporaries,  should  still  not  be  considered  “effective.” 
Farrell  [51],  as  well  as  Chames  and  Cooper  [31],  recognized  this  shortcoming  and  state 
clearly  that  the  resulting  DEA  measure  concerns  relative  efficiency  while  the  question  of 
effectiveness,  or  absolute  efficiency,  remains  with  the  policy  makers  and  analysts. 

However,  the  attractiveness  of  DEA  is  that  it  compares  performance  with  the 
best  actually  achieved  rather  than  some  unattainable  ideal.  If  one  is  concerned  with 
the  question  of  effectiveness,  DEA  could  first  be  used  to  identify  the  technically  efficient 
firms.  The  analyst  could  then  limit  the  scope  to  these  firms  to  address  the  question  of 
effectiveness.  Consequently,  DEA  continues  to  be  of  value  even  when  the  effectiveness 
of  firms  is  drawn  into  question.  Likewi^,  if  a  parametric  mathematical  representation 
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is  desired,  the  analyst  may  limit  the  scope  only  to  the  best  performing  firms  firom 
which  to  build  the  model.  Consequently,  DEA  can  be  of  value  in  legitimizing  a  given 
parametric  approach.  For  the  case  where  prices  are  not  available  for  either  the  outputs  or 
the  inputs  (e.g.,  improvement  in  test  scores  for  elementary  school  students  participating 
in  Headstart  programs  [39]),  DEA  provides  an  ideal  approach  to  measuring  productive 
efiiciency.  However,  for  cases  where  prices,  costs,  measures  of  quality,  etc.,  axe  available, 
DEA  can  still  be  useful  in  developing  alternate  measures  of  efficiency  which  use  this 
additional  data. 

Because  the  economic  term  firm  offers  the  connotation  of  for-profit  organizations, 
a  more  general  definition  is  used  in  DEA  literature.  Since  each  organization  must  decide 
how  best  to  transform  available  inputs  into  desired  outputs,  each  organization  will  be 
referred  to  as  a  decision-making  unit  (DMU). 

1.4.  A  History  of  Frontier  Estimation  Models 
1.4.1.  The  Technical  Efficiency  Measiure 

If  the  set  N  =  {1, 2, . . . ,  n}  is  the  index  set  of  DMUs  being  compared,  we  may 
define  the  efficiency  of  each  DMUj,  j  €  N,  as  Ej  =  A  DMUj  then  is  technically 
efficient  if  Ej  =  z  =  max{Efc},  thus  having  the  highest  output-to-input  ratio.  For  the 

fe€N 

case  of  multiple  inputs  and/or  multiple  outputs,  Ej  can  be  defined  in  a  manner  similar 
to  the  one-input  one-output  case. 

The  production  process  for  DMUs  with  two  inputs  and  one  output  can  be  viewed 
as  in  Figure  1.2.  The  definition  of  efficiency  can  be  extended  to  the  case  of  two  inputs 
and  one  output  by  re-defining  Ej  =  where  Vq  represents  a  weighting  factor 

for  input  q.  The  efficient  DMUs  are  determined  as  before  and  result  in  the  greatest  ratio 
of  output  to  weighted  inputs. 

Naturally,  the  above  metrics  can  be  generalized  to  incorporate  an  arbitrary  num¬ 
ber  of  inputs  and  outputs.  If  R  =  {1, . . . ,  s}  is  an  ordered  index  set  of  s  outputs  and 
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Figure  1.2.  Decision-making  unit 


Q  =  {1, . . . ,  m}  is  an  ordered  index  set  of  q  inputs,  then  the  efliciency  for  DMUj,j  €  N, 
can  be  defined  as: 


reR 

q€Q 


where  Uf  and  Vq  are  weights  placed  on  output  r  £  R  and  input  q  eQ,  respectively. 

A  critical  concern  of  this  approach  is  determining  the  appropriate  weights.  The¬ 
orists  and  policy  makers  could  assign  specific  values  to  the  weights,  but  their  choices 
would  be  open  to  criticism  and  the  validity  of  the  resulting  efilciency  measure  could  be 
suspect. 

An  alternate  approach,  presented  by  Farrell,  avoids  this  problem.  If  the  efiicient 
DMUs  were  known,  the  weights  could  be  defined  so  that  Ej  would  equal  1  for  all  efficient 
DMUs  and  would  be  less  than  1  for  all  inefficient  DMUs.  Consequently,  the  problem  of 
finding  the  relative  technical  efficiency  measure  for  a  given  decision  making  unit,  DMUo, 
can  therefore  be  described  as: 


max  z  =  Eo 

(1.1) 

s.t.  Ej  <  1 , 

jeN 

(1.2) 

«r  >  0, 

reR 

(1.3) 

V,  >  0, 

qeQ 

(1.4) 
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To  find  all  of  the  efiicient  decision-making  units  in  N,  the  above  fractional  pro¬ 
gramming  problem  must  be  solved  n  times,  once  for  each  DMU  of  the  set.  The  computa¬ 
tional  requirements  to  solve  the  fractional  programming  problem  —  at  least  in  this  form 
—  make  solving  anything  other  than  simple  problems  impractical. 

1.4.2.  Data  Envelopment  Analysis  Approach 
1. 4.2.1.  The  CCR  Multiplier  Model 

The  work  of  Chames,  Cooper,  and  Rhodes  [37,38]  transformed  Farrell’s  nonlinear 
fractional  programming  problem  for  determining  Ej  into  a  linear  programming  problem. 
The  computational  advantages  that  the  linear  programming  formulation  offered  made  it 
possible  to  solve  for  the  efficiency  of  DMUs  with  many  inputs  and  outputs.  The  new 
formulation  to  find  Ej  has  become  known  as  the  CCR  MultipUer  Model  ^  and  is  written 


as; 

max  z  =tiYo 

(1.5) 

s.t.  /iY  —  i/X  <  0 

(1.6) 

UXo=l 

(1.7) 

H>£l 

(1.8) 

1/  >  el 

(1.9) 

In  this  model,  let  there  be  n  DMUs  with  X  =  [Xi,...,Xn]  and  Y  =  [Yi,...,!^]. 
Vectors  Xj,Yj  denote  the  observed  data  where,  Xj  =  (xy-, . . -fimj)  is  a  column  vector 
of  observed  inputs  and  Yj  =  (pij, ysj)  is  a  column  vector  of  observed  outputs  for  each 
DMU.  In  this  model,  is  the  vector  of  multipher  weights  applied  to  the  outputs  and  u 
the  multiplier  weights  applied  to  the  inputs.  To  insure  that  the  multipliers  remain  strictly 

^  This  formulation  is  termed  input-oriented;  a  similar  output-oriented  version 
will  be  described  in  a  later  discussion. 


10 


positive,  a  non-Archimedean  scalar,  e,  is  used  in  constraints  (1.8)  and  (1.9).  Eadi  DMUj 
is  assumed  to  have  the  same  inputs  and  outputs  but  in  varying  amounts.  Originally,  all 
observed  inputs  and  outputs  were  assumed  to  be  strictly  positive.  Later,  it  was  shown 
[42]  that  the  CCR  multiplier  model  holds  for  inputs  and  outputs  that  are  strictly  non¬ 
negative. 

An  attractive  feature  of  this  linear  programming  formulation,  consistent  with 
Farrell’s  development,  is  that  no  prior  knowledge  of  the  weighted  values  is  necessary 
to  identify  the  efficiency  of  a  DMU.  Here,  the  maximization  problem  finds  the  most 
favorable  weights  possible  for  DMUo  while  satisfying  all  constraints.  As  a  result,  the 
weights,  /i*  and  t/*,  are  found  to  make  DMUo  look  as  efficient  as  possible  while  insuring 
that  the  ratio  Hr=i  Mryrj/iCili  for  all  other  DMUs  never  exceeds  one.  The 
normalization  constraint  uXo  =  1  (1.7)  linearizes  Farrel’s  objective  function  and,  with 
(1.6),  insures  that  the  optimal  objective  function  value,  z*,  never  exceeds  1.  Indeed 
optimal  solutions  to  the  problem  will  find  0  <  z*  <  1  with  efficient  DMUs  indicated  by 
z*  =  1. 

Note  that  the  non-Archimedean  constant,  e,  represents  an  arbitrarily  smedl, 
strictly  positive  value  that  insures  positivity  of  the  optimal  multipfier  values.  Without 
the  non-Archimedean  term,  a  particular  DMUo  could  have  an  optimal  objective  value 
z*  =  1  with  one  or  more  of  the  multipliers  equal  to  zero,  indicating  DMUo  is  relatively 
efficient  if  one  or  more  of  the  positive  input  or  output  variables  are  ignored.  In  this  case, 
the  DMU  will  be  on  the  efficient  frontier  but  may  not  be  Pareto-Koopmans  efficient  [42] . 
For  example,  let  the  inputs  of  DMUo  consist  of  ATo  >  0  and  the  outputs  result  in  >  0. 
Let  DMUo  identical  to  DMUo  in  its  production  of  outputs  Y*  =  Yo  but  with  <  Xo 
where  the  equality  holds  for  all  a;*^,  i  =  1, . . . ,  m,  except  for  <  Xko  where  1  <  fc  <  m. 
Assmne  at  optimality,  DMUo  is  determined  to  be  efficient  with  fx*  >  0  and  t/*  >  0 
with  only  i/]J  =  0.  In  this  case,  DMUJ  would  also  be  efficient.  It  is  clear,  however,  that 
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DMUo  is  inefficient  relative  to  DMU^  because  both  DMUs  produce  the  identical  levels 
of  output  but  DMUq  consumes  less  of  one  input.  DMUo  cannot  be  Pareto-Koopmans 
efficient  if,  while  maintaining  the  same  levels  of  all  outputs,  the  production  possibility 
set  allows  for  a  decrease  of  an  input  while  holding  all  other  input  levels  constant.  A 
similar  example  can  be  constructed  when  one  of  the  multipliers  of  an  output  variable  is 
equal  to  zero  at  optimality.  A  DMUo  with  2*  =  1  but  with  at  least  one  multipher,  /x* 
or  i/*,  equal  to  zero  in  all  alternate  solutions  is  termed  “weakly  efficient”  by  Chames, 
Cooper,  and  Thrall  [42]. 

1.4. 2. 2.  CCR  Envelopment  Model 

The  linear  programming  CCR  multiplier  model  has  an  associated  dual  formula¬ 
tion  which  has  become  known  as  the  (input-oriented)  CCR  envelopment  model.  This 
model  is  written  as: 


(CCR’o)  min  «;  =  0  ~  e(ls*  -I-  ls°)  (1.10) 

s.t.  YA  -  s°  =  Yo  (1.11) 

exo-x\-8*  =  o  (1.12) 

\s\8°>0  (1.13) 

6  free  (1-14) 


where  8°  and  s*  are  the  slacks  associated  with  the  output  and  input  constraints,  respec¬ 
tively.  This  is  the  most  widely  discussed  DEA  formulation  in  current  literature.  Even 
though  the  CCR  multipher  and  envelopment  models  yield  equivalent  optimal  objective 
values,  the  envelopment  model  is  more  efficient,  computationally,  because  of  the  smaller 
number  of  constraints.^ 

^  With  the  envelopment  model,  the  number  of  constraints  equaJs  the  number  of 
outputs  and  inputs  observed  for  each  DMU.  The  number  of  variables  equals  the  number 
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The  non-Archimedean  tenn,  e,  of  the  CCR  models  poses  a  problem.  In  theory, 
this  term  is  denoted  by  an  arbitrarily  small  positive  value;  in  practice  though,  it  must  be 
explicitly  defined.  Ali  and  Seiford  [9]  show  that  the  choice  of  e  can  dramatically  effect 
the  solution  of  problems  and  produce  erroneous  efficiency  scores.  This  concern  can  be 
overcome  as  described  in  the  next  section. 

1. 4.2.3.  Ali’s  IDEAS  Formulation 

Ali  [3]  presented  a  variation  of  the  CCR  envelopment  model  which  eliminated  the 
need  to  explicitly  define  e.  This  new  approach  for  the  CCR^o  formulation,  called  the 
integrated  data  envelopment  analysis  system  (IDEAS),  can  be  written  as  the  following 
pair  of  linear  programs: 


(IDEAS  Phase  I)  min  d  (1.15) 

s.t.  Y\-s°  =  Yo  (1.16) 

eXo-XX-8*  =  0  (1.17) 

A,s^s°>0  (1.18) 

6  free  (119) 

(IDEAS  Phase  II)  max  +  s*  (1.20) 

s.t.  YX-8°  =  Yo  (1.21) 

eXo-XX-8*  =  0  (1.22) 

9  =  0*  (1.23) 

\,8\8°>0  (1.24) 

9  free  (1.25) 


of  DMUs.  In  data  envelopment  analysis,  the  munber  of  DMUs  being  observed,  n,  is 
typically  much  larger  than  the  number  of  output  and  input  variables  {s  +  m).  If  this 
were  not  the  case,  all  DMUs  would  likely  be  efficient. 
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The  9*  in  Phase  II  is  the  optimal  solution  found  in  Phase  I.  It  has  been  shown  that  a 
DMU  is  efficient  if  and  only  if  =  1,  s®*  =  0,  s**  =  0  for  the  problem  above  [3,9,31,52]. 
Otherwise,  a  weakly  efficient  DMU  would  have  6*  =  I  and  at  least  one  positive  slack 
[33],  or  a  DMU  with  ^*  <  1  would  be  inefficient  with  9*  measuring  the  extent  of  the 
inefficiency. 

The  non-Archimedean  constant  has  been  removed  from  the  problem  formulation 
but  at  a  possible  computational  cost  of  solving  an  additional  Unear  program  for  each 
DMU  during  the  two-stage  approach.'^  Later,  this  paper  will  show  how  the  computational 
costs  of  this  approach  can  be  minimized  to  achieve  the  benefit  of  eliminating  the  non- 
Archimedean  term. 


1.4.3.  DEA  Model  Variations 

The  above  efficiency  measures  all  share  a  common  shortcoming:  they  only  ac¬ 
count  for  constant  returns  to  scale.  As  a  result,  only  a  subset  of  the  DMUs  on  the 
efficient  production  surface  may  be  revealed.  This  can  best  be  seen  by  means  of  an 
iUustration.  Figure  1.3  depicts  the  case  of  a  coUection  of  DMUs,  each  with  a  single 
output  and  single  input. 

The  efficient  production  surface  is  outlined  by  DMUs  a,  b,  c,  d,  and  e.  Only 
DMU(„  and  DMUc  exhibit  constant  returns  to  scale.  In  economics  terms,  these  are  the 
DMUs  that  result  in  the  greatest  average  product  and  represent  the  most  productive 
scale  size  in  terms  of  the  level  of  output  produced.®  The  points  which  exhibit  constant 
returns  to  scale  on  the  production  surface  can  be  found  by  drawing  a  line  through  the 
origin  which  is  tangent  to  the  production  surface.  The  other  points  on  the  production 

*  Just  as  the  so-called  “Big  M”  value  is  not  explicitly  defined  in  the  “Phase  I- 
11”  approach  to  solving  linear  progranuning  problems,  IDEAS  uses  a  two-stage  process 
to  circumvent  the  need  to  explicitly  define  e. 

®  Banker  specifically  identifies  relationships  between  the  most  productive  scale 
size  and  DEA  in  his  1984  paper  [10]. 
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Figure  1.3.  Technically  efficient  DMUs 

surface,  arc  technically  efficient,  but  are  not  scale  efficient  at  the  most-productive  scale 
size.  Technical  efficiency  implies  that  for  a  given  level  of  output,  the  efficient  DMUs 
consume  the  least  amoimt  of  input. 

If  constant  scale  efficiency  implies  a  most-productive  scale  size,  how  can  DMUj, 
and  DMUc  both  exhibit  constant  returns  when  they  produce  different  levels  of  outputs? 
Even  though  DMUj,,  and  DMUc  produce  different  levels  of  output,  the  amount  of  the 
difference  is  exactly  proportional  to  the  difference  in  the  amount  of  inputs.  Consequently, 
the  two  DMUs  are  called  scale  equivalent.  DMUj  and  DMUe  represent  decreasing  returns 
to  scale  compared  to  the  most  productive  sceile-efficient  DMUs  because  they  consume 
more  input,  but  produce  less  than  a  proportional  increase  in  output.  Moving  from 
DMUa  to  the  most  productive  scale  efficient  DMUs  represents  increasing  returns  to  sea' 
because  the  percentage  increase  in  the  amount  of  output  produced  exceeds  the  percentage 
increase  in  the  amount  of  inputs  consumed. 

All  of  the  above  discussion  extends  to  higher  dimensions.  By  limiting  the  scope  to 
constant  returns  to  scale,  only  a  portion  of  the  efficient  production  surface  is  identified. 
Consequently,  extensions  to  the  CCR  data  envelopment  analysis  measure  were  developed 
to  identify  all  technically  efficient  DMUs. 
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Farrell  and  Fieldhouse  [52]  noticed  this  limitation  to  the  Farrell  measure  in  1962. 
They  presented  a  “grouping  method”  to  account  for  increasing  and  decreasing  returns  to 
scale.  In  this  methodology,  DMUs  with  similar  input  and  output  amounts  were  grouped 
together  and  efficiency  scores  determined  with  respect  to  the  other  DMUs  within  the 
particular  group.  This  method  never  gained  wide  acceptance,  and  in  the  case  of  multiple 
outputs  and  multiple  inputs,  the  process  became  too  complex  and  imreliable. 

1.4.3. 1.  BCC  Variable-Returns-to-Scale  Model 

In  1984,  Banker,  Chames,  and  Cooper  [11]  successfully  modified  the  original  CCR 
envelopment  formulation  to  allow  for  variable  returns  to  scale.  The  BCC  input-oriented 
envelopment  model  can  be  written  as: 


(5CC7* )  min  9  -  e(ls*  -I- 15°)  (1.26) 

s.t.  Y\-8°  =  Yo  (1.27) 

0Xo-XA-s*  =  O  (1.28) 

lA  =  1  (1.29) 

A,s^s°>0  (1.30) 

6  free  (1-31) 


In  their  paper.  Banker,  Chames,  and  Cooper,  show  how  this  new  formulation 
reveals  the  entire  efficient  production  surface  and  all  technically  efficient  DMUs.®  In  a 
sense,  the  additional  constraint  automatically  groups  DMUs  with  similar  output  levels 
and  determines  the  appropriate  efficiency  score.  For  inefficient  DMUy  in  Figure  1.3,  the 
efficiency  score  will  be  less  than  1.  The  overall  efficient  score  found  by  the  CCR  model 

®  One  assumption  of  this  formulation  is  that  regions  of  increasing  returns  to 
scale  are  followed  by  regions  of  constant  or  decreasing  returns  to  scale.  This  is  beneficial 
because  it  satisfies  the  normal  economic  assumption  of  convexity  of  the  production 
function. 
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will  be  calculated  as  the  ratio  of  the  lengths  of  the  Une  segments  In  a  Uke  manner 

the  technical  efliciency  score,  found  by  the  BCC  model,  will  be  The  scale  efficient 

measure,  can  be  found  by  dividing  the  CCR  result  by  the  BCC  result.  Obviously, 
if  a  DMU  is  both  scale  and  technically  efficient,  the  efficiency  measure  will  be  1  in  both 
cases.  Notice  that  for  some  DMUs,  such  as  DMUg,  the  scale  and  technical  inefficiency 
scores  will  be  identical.  In  other  cases,  such  as  DMUj,  the  DMU  will  be  technically 
efficient  but  not  scale  efficient.  Therefore,  to  determine  the  technical,  scale,  and  overall 
efficiency,  two  linear  programs,  the  CCR  and  the  BCC  models,  must  be  solved  for  each 
DMU. 


1.4.4.  Compendium  of  Data  Envelopment  Models 
The  formulations  of  efficiency  measure  discussed  so  far  are  all  input-oriented, 
that  is,  the  efficient  DMUs  are  discovered  by  observing  an  output  level  and  identifying 
the  DMU  that  consumes  the  least  input.  Inefficient  DMUs  are  projected  onto  the 
efficient  surface  by  proportionally  reducing  all  inputs  while  holding  output  constant.  The 
projection  distance  jdelds  the  efficiency  score. 

However,  a  different  perspective  can  be  accomplished  by  viewing  efficiency  from 
an  output-oriented  perspective.  In  this  case,  the  efficient  DMUs  axe  discovered  by 
observing  a  given  level  of  input  and  identifying  the  DMU  that  produces  the  most  output. 
The  inefficient  DMUs  are  projected  onto  the  efficient  surface  by  proportionally  increasing 
all  outputs  while  holding  inputs  constant.  The  projection  distance  is  the  output-oriented 
inefficiency  score.  It  is  important  to  note  that  the  input-  and  output-oriented  schemes  of 
the  BCC  model  identify  the  same  efficient  surface,  which  is  composed  of  all  technically 
efficient  DMUs.  However,  the  scores  for  the  inefficient  DMUs  may  vary  between  the  two 
formulations.  For  the  CCR  model,  the  input-  and  output-oriented  schemes  will  identify 
the  same  most-productive-scale-size  efficient  surface  and  the  scores  for  the  inefficient 
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DMUs  of  the  input-oriented  model  will  be  the  reciprocals  of  the  scores  for  the  output- 
oriented  model.  Seiford  and  Thrall  [81]  present  a  generaUzation  of  the  BCC  and  CCR 
models  with  input-  and  output-oriented  schemes  and  define  models  with  non-decreasing 
returns  to  scale  (NDRS)  and  non-increasing  returns  to  scale  (NIRS).  Table  1.1  and  Table 
1.2  describe  the  envelopment  and  multipfier  model  formulations. 

The  analysis  that  follows  in  this  paper  will  be  based  on  the  input-  and  output- 
oriented  models  presented  here.  To  identify  all  types  of  efficiency  for  a  given  DMU, 
eight  linear  programs  must  be  solved.  This  involves  identifying  the  scale  and  technical 
efficiency  measures  for  both  the  input-  and  output-oriented  models  using  the  two-phase 
approach  of  the  IDEAS  model.  Other  variations  of  these  models  have  been  developed 
which  further  increase  the  computational  complexity  of  determining  the  DEA  efficiency 
measures  for  the  DMUs  of  interest. 

Although  applications  containing  tens  of  thousands  of  DMUs  are  now  beginning 
to  arise  in  practice,  the  computational  issues  associated  with  such  large-scale  models 
have  not  been  addressed  in  the  literature.  This  research  explores  parallel  processing  as  a 
means  of  approaching  these  challenging  problems.  The  next  section  presents  an  overview 
of  the  salient  concepts  associated  with  parallel  programming  as  a  background  for  this 
discussion. 

1.5.  Parallel  Computing  Fundamentals 

A  relatively  new  and  expanding  field  that  offers  great  promise  for  solving  difficult 
large-scale  problems  is  application-level  parallel  processing.  Computational  gains  can  be 
achieved  in  a  multitasking  setting  where  the  power  of  multiple  processing  elements  act 
concurrently  to  find  the  solution  for  a  single  problem.  It  is  important  to  distinguish  the 
multitasking  of  a  single  application  from  multiprogramming  wherein  an  operating  system 
allows  a  computer  to  execute  multiple  umrelated  programs. 
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Table  1.1.—  Envelopment  models 


BCC) 

NDRS) 

NIRS) 


CCRf^ 

BCCTj 

NDRS^j 

NIRS^ 


ADD^ 


(Input-oriented) 

min  {Oj  -  e(la‘  +  1«°)|YA  -  =  Yj,ejXj  -  XA  -  a*  =  0} 

min  {Oj  -  e(la‘  +  la®)|YA  -  a®  =  Y ,  9jXj  -  XA  -  a*  =  0,  lA  =  1 } 

min  {9i  -  e(la‘  +  la°)lYA  -  a°  =  Y ,  9iXj  -  XA  -  a‘  =  0, 1 A  >  1 } 

A,a*,«°>0,aj 

min  {9i  -  e(la*  -I-  la°)|YA  -  a®  =  Yj,9jXj  -  XA  -  a*  =  0,  lA  <  1} 


( Output-oriented) 

min  {<i)j  +  e(la*  -I-  la°)|XA  +  a‘  =  Xj,  <i>jYj  -  YA  -I-  a°  =  0} 

min  Uj  +  e(ls*  +  la®)|XA  a*  =  Xj,  <i>jYj  -  YA  -I-  a°  =  0,  lA  =  0} 

min  {4>j  +  £(la‘  -I-  la«)|XA  -I-  a*  =  Xj,  <i>jYj  -  YA  -I-  a°  =  0,  lA  >  0} 

X,a*,a°>Q,4>j 

min  {4>j  +  e(la*  -I-  la‘‘)|XA  -I-  a*  =  Xj ,  4>jYj  -  YA  -t-  a®  =  0, 1 A  <  0} 


(Non-oriented) 

min  {-la*  -  la°|YA  -  a°  =  Y,  -XA  -  a*  =  -X,,  lA  =  1} 

A,«*,«°>0 


Table  1.2.—  Multiplier  models 


CCR) 

BCC) 

(Input-oriented) 

m^{2  =  /iijI/iY  —  i/X  <  0; uXj  =  l,li>el,u>  el} 
max  {z  —  uYj  +  u*|iiY  —  i/X  u*l  <  0; i/Xj  =  1;  u  >  el, i/  >  el, u»  free} 

max  {z  =  fxYj  -f-  u*|#iY  —  */X  -1-  u*l  <  0;  i/Xj  =  l,fji>el,t/  >  el, u»  >  0} 

max  {z  =  fiYj  +  u*|/iY  —  i/X  +  u*l  <  0;  uXj  =  el,u  >  el,  u*  <  0} 

NDRSj 

NIRS) 

CCR) 

W^j 

(Output-oriented) 

min{2  =  i/Xj|/iY  -  i/X  <  0,|iY  =  1,/i  >  el,*/  >  el} 

M.*' 

min  {z  =  uXj  +  t;*luY  -  i/X  —  v*l  <  0,  uYj  =  1,  ii  >  el,  */  >  el,  w*  free} 
min  {z  =  uXj  +  u*(/iY  -  i/X  —  u*!  <  0,uY  =  1,/x  >  el,*/  >  el,v*  <  0} 
min  {2  =  uXj  +  vJ^Y  —  uX  —  u*l  <  0,  liY  =  1, #*  >  el,  */  >  el, u,  >0} 

|j2^| 

NIRS^j 

(Non-oriented) 

ADDj  :  —  uXj  +  u;|/iY  —  i/X  -1-  Iw  >  0,  — >  —1,  —v  >  — 1,  w  free} 
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The  basic  premise  in  multitasking  in  a  parallel  environment  is  that,  for  any 
given  problem,  a  large  number  of  tasks  must  be  completed  to  successfully  arrive  at  a 
correct  solution.  If  these  tasks  can  be  scheduled  for  simultaneous  execution  on  multiple 
processors,  opportunities  exist  for  achieving  dramatic  reductions  in  solution  times.  This 
fundamental  motivation  for  parallel  programming  suggests  great  performance  gains  can 
be  made  using  new  hardware  technology  if  the  work  can  be  decomposed  effectively. 

Substantial  performance  gains  can  be  difficult  to  achieve.  While  modest  gains 
are  possible  for  many  applications  in  a  parallel  setting,  outstanding  results  occur  only 
for  a  much  smsJler  set  of  problems.  To  achieve  outstanding  results,  the  majority  of 
steps  of  the  algorithm  must  be  done  in  parallel,  and  the  algorithm  must  closely  match 
the  architect  : re  of  the  underlying  machine.  As  a  result,  advances  in  algorithm  perfor¬ 
mance  are  tied  to  how  well  they  can  exploit  the  computing  machinery.  As  advances 
in  technology  offer  new  challenges  in  developing  improved  algorithms  to  match  new 
hardware,  insights  into  computational  aspects  of  the  problem  can  be  gained  as  the  new 
algorithms  are  developed.  These  problem  insights  can  be  useful  both  to  the  parallel  and 
serial  environments.  Therefore,  even  though  the  challenge  to  achieve  outstanding  gains 
through  parallel  programming  may  seem  formidable,  the  challenge  also  has  the  potential 
to  provide  new  insights  into  the  solution  process  that  may  otherwise  go  imnoticed. 

1.5.1.  Measuring  Parallel  Performance 

Parallel  processing  consists  of  coordinating  the  power  of  multiple  processors  to 
accomplish  a  given  task  concerning  a  given  set  of  data  [20].  To  measure  the  impact  of 
parallelization,  a  metric  is  needed  to  serve  as  a  basis  of  comparison  between  serial  and 
parallel  performance. 

Of  particular  interest  is  a  measure  of  the  ability  of  parallel  processing  to  improve 
the  real  time  to  solve  a  particular  problem.  Such  improvement  permits  existing  prob¬ 
lems  to  be  solved  faster,  and  may  render  solvable  problems  whose  dimensions  exceed 
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traditional  computational  capacities.  Regardless  of  the  motivation,  a  measure  is  needed 
to  reflect  the  computational  improvement  offered  by  the  parallel  algorithm  over  a  serial 
algorithm  to  accomplish  the  same  task. 

Speedup,  of  which  there  are  several  definitions  [20],  is  the  most  common  measure 
of  the  effects  of  parsdlelization  on  a  problem.  In  this  research  report  relative  speedup 
is  used.  Relative  speedup,  S{p)  is  the  ratio  of  a  problem’s  solution  time  using  the  serial 
version  of  the  algorithm  to  the  time  using  the  parallel  version  of  the  same  algorithm  with 
p  processors  on  the  same  machine  [62].  In  the  development  of  any  parallel  algorithm,  a 
design  objective  is  to  use  all  resources  efliciently  to  achieve  linear  speedup  where  S[p)  —  p. 
The  possibility  of  achieving  superlinear  speedup,  S{p)  >  p,  is  open  for  debate  [20]. 

Many  factors  determine  the  speedup  potential  for  any  parallel  algorithm.  In  1967, 
G.M.  Amdahl  formulated  a  law  describing  the  maximum  possible  speedup  on  a  parallel 
computer  with  p  processors.  Amdahl’s  law  can  be  written  as: 


S{p)< 


1 

a  +  (1  -  a)/p 


where  5(p)  represents  speedup  using  p  processors,  and  a  is  the  fraction  of  code  that  must 
be  run  in  serial  (i.e.,  is  not  parallelizable).  FVom  the  law  we  can  see 


lim  S(p)  <  — . 

p-*oo  a 


Therefore,  according  to  Amdahl’s  law,  linear  speedup  may  only  be  possible  when  a  small 
portion  of  the  programming  steps  is  restricted  to  the  seri€d  case. 

Figure  1.4  depicts  the  maximum-attainable  speedup,  according  to  Amdahl’s  law, 
for  various  ranges  of  p  and  a.  This  illustrates  the  inherent  difficulty  of  achieving  linear 


21 


Figure  1.4.  Maximum-attainable  speedup  as  a  function  of  a  and  p 


speedup.  It  becomes  obvious,  that  the  achievement  of  Unear  speedup  as  the  number  of 
processors  increases  demands  almost  complete  parallehzation  of  the  algorithm’s  steps. 

Efficiency,  defined  as  E{p)  =  is  another  measure  commonly  used  to  deter¬ 
mine  the  performance  of  the  parallel  algorithm,  reflecting  speedup  normalized  by  the 
number  of  processors.  An  efliciency  of  1  reflects  Unear  speedup;  a  value  less  than  1  is  the 
fraction  of  Unear  speedup  attained. 

Table  1.3  reveals  the  difierent  values  of  o  that  are  necessary  to  achieve  a  given 
level  of  efficiency  for  various  number  of  processors.  This  table  imderscores  the  difficulty  of 
achieving  Unear  speedup,  which  rises  proportionately  with  the  number  of  processors  used. 
With  more  processors,  the  fraction  of  tasks  that  require  serial  implementation  must  be 
very  small.  However,  as  Amdahl  noticed,  the  fraction,  a  often  depends  on  the  problem 
size.  If  n  measures  problem  size,  then  a(n)  — »  0  as  n  — »  oo  since,  for  many  problems,  the 
amoimt  of  work  to  be  done  in  serial  is  at  a  fixed  level  regardless  of  problem  size.  This 
fixed  amount  of  work  becomes  negUgible  for  large  problem  sizes.  As  a  resxilt,  for  a  fixed 
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Table  1.3.  ~  Necessary  Amdahl  fraction  a  levels 


Number  of 
processors 

EflBciency 

.8 

.9 

.95 

.99 

5 

msuM 

Ha 

10 

15 

KuS 

.0038 

.0007 

20 

isa 

.0028 

.0005 

p,  S(p)  —*  p  as  n  —*  oo.  Consequently,  linear  speedup  may  be  possible  for  large,  highly 
decomposable  problems. 


1.5.2.  Parallel  Computer  Architectures 

When  approaching  a  problem  for  parallel  implementation,  the  designer  must  in- 
siure  the  algorithm  takes  into  account  the  advantages  of  the  specific  hardware  on  which 
the  algorithm  will  be  executed.  Unlike  single-processor  machines,  where  serial  algorithms 
will  execute  in  a  more  or  less  common  fashion  across  many  different  types  of  computers, 
the  parallel  implementation  must  be  tailored  to  the  specific  hardware.  Parallel  architec¬ 
tures  take  on  many  different  configurations,  each  offering  its  own  unique  set  of  advantages 
and  disadvantages.  Consequently,  an  algorithm  designed  for  a  particular  parallel  machine 
may  be  completely  inappropriate  for  another  type  of  parallel  computer.  In  1972  Fl3mn 
[56]  offered  a  classification  scheme  for  all  types  of  parallel  computers.  The  four  classifi¬ 
cations  are  dependent  on  the  type  of  instruction  stream  and  data  stream  the  computer 
is  capable  of  controlling.  The  instruction  stream  is  the  sequence  of  programming  steps 
carried  out  on  the  computer  to  arrive  at  a  solution  to  the  problem.  The  objects  on  which 
the  instructions  act  are  called  the  data  stream.  The  single-instruction,  single-data  (SISD) 
computers  are  the  traditional  uniprocessor  computers  which  range  from  a  personal  com¬ 
puter  to  the  Cray-1  supercomputer.  The  single-instruction,  multiple-data  computers 
(SIMD)  are  represented  by  multi-processor  computers  where  each  processor  executes 
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the  same  instruction  sequences  on  diflFerent  sets  of  data.  The  best-known  machine  in 
this  category  is  the  Connection  Machine  (models  CM-1  and  CM-2)  that  contains  up  to 
64,000  processors.  Currently,  there  are  no  known  general-purpose  multiple-instruction, 
single-data  (MISD)  computers.  The  most  common  parallel  architecture  is  the  multiple- 
instruction,  multiple-data  (MIMD)  computer  which  can  simultaneously  operate  multiple 
independently  executing  processors  on  different  data  sets. 

Both  the  SIMD  and  MIMD  architectures  require  conununication  between  the 
processors  to  insure  integrity  of  the  solution  procedure.  The  conununication  can  occur 
either  through  shared-memory  access  via  a  central  switch  (bus)  or  by  message-passing 
through  an  interconnected  network  in  a  distributed  system.  The  shared-memory,  or 
tightly  coupled,  design  offers  the  advantage  of  minimal  communication  overhead  and 
simplified  programming,  but  the  bus  bandwidth  limits  the  number  of  processors.  With 
tightly  coupled  shared  memory,  each  processor  takes  the  same  amount  of  time  to  access 
a  particular  memory  location.  Although  distributed  processing  systems  pose  few  limits 
on  the  number  of  processors,  they  have  higher  communication  overhead  and  require  more 
elaborate  programming  techniques. 

All  computational  testing  for  this  research  was  conducted  on  a  tightly  coupled 
MIMD  Sequent  Symmetry  S81  (Rev  B)  at  Southern  Methodist  University.  It  is  config¬ 
ured  with  20  Intel  80386  processors  and  Weitek  numeric  coprocessors  and  32  megabytes 
of  sharable  memory.  A  Unix  operating  sj^tem  is  used  to  schedule  the  parallel  tasks. 

1.5.3.  Programming  MIMD  Multiprocessors 

Parallel  processing  consists  of  concurrent  manipulation  of  data  among  several 
processors.  The  goal  of  parallel  algorithm  design  is  to  minimize  a  while  maintaining 
a  balance  of  computational  load  across  all  processors.  The  overall  goal  of  the  use  of 
parallelism  is  to  increase  the  speed  in  solving  the  problem. 
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Several  approaches  exist  in  parallel  programming  to  distribute  the  work  load 
among  the  processors.  The  two  primary  methods  of  performing  parallel  operations  are 
functional  partitioning  and  data  partitioning  (75).  The  two  may  be  used  separately  or 
in  combination  with  any  algorithm  on  a  MIMD  machine.  With  functional  partitioning, 
the  various  processors  perform  different  functions  on  the  same  data  set  to  complete  the 
problem.  An  analogy  of  this  process  can  be  made  with  the  construction  of  a  house. 
Several  tasks  are  needed  to  complete  the  structure.  The  foundation  must  be  poured, 
framing  done,  plumbing  and  electrical  work  completed,  dry  wall  himg,  bricks  laid,  and 
the  finished  work  painted.  While  some  tasks  depend  on  the  completion  of  others,  many 
can  be  done  concurrently.  In  the  this  case,  the  electricians  play  the  role  of  one  processor, 
the  plumbers  another,  etc.  each  operating  differently  on  different  aspects  of  the  data  set, 
the  house. 

Data  partitioning  or  domain  decomposition  is  fundamentally  different  from  func¬ 
tional  partitioning.  In  data  partitioning,  each  processor  is  assigned  a  similar  task  to 
be  conducted  on  uifferent  parts  of  the  data.  Again,  analogies  can  be  drawn  with  the 
construction  of  the  house.  As  with  multiple  processors,  multiple  bricklayers  can  work 
together  to  either  complete  a  single  wsdl  or  multiple  walls  faster  than  a  single  worker. 
Again,  for  successful  completion,  communication  must  occur  between  the  different  pro¬ 
cessors  to  insure  a  correct  outcome. 

Determining  whether  to  use  functional  partitioning  or  data  partitioning  is  critical 
to  the  successful  implementation  of  the  parallel  algorithm.  An  incorrect  choice  could 
limit  useful  speedup  because  of  the  added  time  for  synchronization  and  communication 
between  processors.  Additionally,  load  imbalances  can  occ'ir  where  some  processors  wait 
for  others  to  complete  their  assigned  work.  In  the  DEA  study  reported  here,  data 
partitioning  was  used  because  of  special  characteristics  of  the  DEA  linear-programming 
problems;  exceptional  speedup  was  accomplished  across  a  large  number  of  processors. 
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Once  a  functional-partitioning  or  data-partitioning  approach  is  chosen,  a  decision 
must  be  made  on  how  to  schedule  the  tasks  on  the  available  processors.  Two  methods 
of  scheduling  the  jobs  are  pre-schedtding  and  self-scheduling  [75].  With  pre-scheduhng, 
the  distribution  of  work  across  the  processors  is  set  prior  to  compilation  and  requires 
little  overhead.  This  type  of  scheduUng  is  ideal  when  the  tasks  require  the  same  amount 
of  time.  If  the  tasks  require  differing  amounts  of  time,  the  workload  could  be  severely 
imbalanced  across  the  processors.  A  second  method  to  schedule  the  tasks  is  known  as 
self-scheduling  or  dynamic  scheduling.  This  method  allows  each  processor  to  schedule  its 
own  tasks  at  run  time  by  selecting  work  from  a  task  queue;  once  the  task  is  completed, 
the  processor  returns  to  the  queue  to  acquire  additional  work.  The  overhead  of  coordi¬ 
nating  the  tasks  and  maintaining  the  work  queue  is  more  time  consuming  than  with  the 
pre-scheduled  approach,  but  self-scheduling  maintains  a  more  balanced  workload  across 
all  processors.  For  the  DEA  code  described  herein,  the  self-scheduUng  approach  was  used 
since  tasks  required  varying  amounts  of  time,  and  workload  balance  was  a  high  priority 
in  the  algorithm  design. 


1.6.  Objectives 

This  dissertation  presents  several  advances  in  data  envelopment  analysis.  First, 
a  new  computational  approach  for  solving  large  scale  problems  is  described,  with  testing 
presented  for  both  serial  and  parallel  environments.  Second,  a  new  DEA  modeling  and 
evaluation  scheme  is  presented  that  gives  deeper  insights  into  the  relationships  among 
the  decision-making  units  than  traditional  analysis.  Finally,  a  new  means  of  assessing 
the  robustness  of  a  decision-unit’s  efficiency  is  given  which  spans  ail  current  models.  All 
of  the  new  models  described  can  build  on  the  computational  advances  to  make  practical 
their  implementation  in  industrial  and  governmental  apphcations. 

Chapter  II  describes  a  state-of-the-art  parallel  approach  to  solving  DEA  prob¬ 
lems.  By  using  linear  programming  theory,  a  computationally  efficient  approach  is 
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introduced  and  the  solution  to  previously  unsolvable  problems  will  be  anaJyzed.  The 
parallel  approach  motivates  a  new  hierarchical  decomposition  methodology  for  solving 
DEA  by  first  isolating  the  efficient  DMUs  and  then  determining  the  DEA  scores  for  the 
inefficient  DMUs.  The  new  approach  requires  fewer  computing  resources  and  improves 
solution  time  by  one  to  two  orders  of  magnitude  for  large-scale  DEA  applications.  The 
hierarchical  approach  also  reduces  the  need  to  duplicate  effort  when  solving  variations 
of  the  DEA  models  on  the  same  set  of  data.  This  opens  the  door  to  a  new  range  of 
applications  for  the  family  of  DEA  models. 

These  computational  improvements  make  more  practical  a  new  approach  to  rank¬ 
ing  DMUs  as  discussed  in  Chapter  III.  This  approach  may  be  useful  to  managers  by 
providing  a  different  “picture”  of  the  efficient  surface  and  sublayers.  This  will  eillow 
managers  to  observe  more  options  for  improving  production  techniques  by  striving  not 
only  for  improved  efficiency  scores,  but  by  improving  its  rank  order  among  competitors. 
The  approach  will  also  demonstrate  how  outliers,  which  may  distort  the  DEA  efficiency 
measure,  can  be  accoimted  for  emd  corrected. 

In  Chapter  IV  an  improved  sensitivity  analysis  approach  is  presented  which  helps 
develop  a  picture  for  the  robustness  of  the  operations  of  a  particular  DMU.  The  moti¬ 
vation  for  the  chapter  resulted  from  the  need  to  know  how  much  the  use  of  inputs  and 
production  of  outputs  can  change  before  a  DMU  changes  its  efficiency  status. 

Chapter  V  closes  with  observations  and  conclusions  drawn  from  this  research 
effort.  Fhture  research  directions  are  discussed  along  with  a  summary  of  the  advances 
made  in  this  report. 


CHAPTER  II 


NEW  ADVANCES  IN  COMPUTATIONAL  DEA 

Data  envelopment  analysis  (DEA)  has  become  an  established  nonparaunetric 
methodology  for  assessing  the  relative  efficiency  of  comparable  production  and  other 
decision-making  units.  As  such,  it  has  evolved  into  an  approach  to  evaluating  certain 
types  of  data  sets:  namely,  those  that  can  be  viewed  as  emanating  from  a  (economic) 
production  function.  The  range  of  applicability  turns  out  to  be  a  wide  one  and,  while 
the  approach  is  new — relative  to  other  multivariate  analysis  techniques — its  usage  and 
variety  of  applications  has  grown  rapidly  since  its  introduction  in  1978. 

Surprisingly,  little  has  been  published  on  the  computational  aspects  of  DEA 
[3,4,6,7,9,46,76].  Since  DEA  tjTsically  involves  the  solution  of  a  large  number  of  linear 
programs,  many  practitioners  and  researchers  assume  that  the  repeated  use  of  standard 
linear  programming  codes  is  sufficient  for  an  analysis.  Unfortunately  this  is  not  the 
case.  Specialized  codes  are  needed  to  correctly  handle  the  preemptive  prioritized  multiple 
objectives  (reflecting  the  models’  non-Archimedian  infinitesimal)  and  to  coordinate  and 
expedite  the  solution  of  the  large  number  of  related  linear  programs. 

The  state  of  the  art  for  DEA  codes  has  also  been  limited  in  the  size  of  data  sets 
that  can  be  evaluated  in  a  reasonable  time:  typically  in  the  hundreds  of  data  points. 
(Codes  that  handle  1,000  observations  have  been  reported,  but  run  times  can  be  on  the 
order  of  days  [5].)  This  st  jdy  was  motivated  by  large  applications  we  have  encountered: 
franchise  analysis  (e.g.,  over  8,000  McDonald’s  restaurants,  6,500  Century  21  real-estate 
offices,  and  approximately  5,000  H&R  Block  tax-preparation  service  centers),  the  Federal 
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Reserve  Bank’s  efficiency  study  of  8,742  U.S.  banks,  a  V.A.  study  of  over  20,000  hospi¬ 
tals,  and  U.S.  Postal  Service  evaluations  of  over  30,000  branches.  These  problem  sizes 
are  clearly  beyond  the  limits  of  current  DEA  codes. 

This  chapter  describes  a  new  code  for  solving  large-scale  DEA  problems  in  a 
reasonable  amoimt  of  time,  demonstrate  its  performance  on  a  real-world  application,  and 
report  on  its  ability  to  exploit  parallel  processing  to  further  accelerate  solution  time.  We 
also  introduce  a  new  decomposition  algorithm  that  streamlines  the  solution  of  problem 
sets  and  provide  in-depth  computational  testing  of  the  code  on  small-  and  large-scale 
problems,  including  the  largest  DEA  problems  reported  to  date. 

2.1.  Computational  DEA 

Key  to  the  development  of  software  for  putting  DEA  into  practice  are  the  math¬ 
ematical  underpinnings  and  associated  means  of  exploiting  domain-specific  structure  for 
computational  gain.  We  now  briefly  summarize  relevant  DEA  concepts,  describe  efficient 
implementation  techniques,  and  present  the  results  of  computational  testing  of  a  new 
DEA  research  code. 


2.1.1.  DEA  Fundamentals 

As  described  previously,  data  envelopment  analysis  is  a  family  of  models  for 
assessing  and  analyzing  the  relative  transformational  efficiency  of  similar  decision-making 
units.  A  decision-making  unit  (DMU)  is  any  organization,  process,  or  other  entity  that 
uses  a  set  of  resources  (inputs)  to  produce  a  set  of  products  or  services  (outputs).  All 
DEA  models  have  the  same  data  requirements:  for  each  DMU  j,  a  nonegative  observed 
value  for  each  of  the  s  outputs  (Yj)  and  m  inputs  (Xj),  with  at  least  one  positive  input 
and  output. 

A  DMU’s  efficiency  is  defined  in  the  tradition  of  engineering  and  financial  anal¬ 
ysis:  the  ratio  of  the  weighted  total  output  produced  to  the  total  input  consumed,  or 
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fj,Yj/ifXj.  For  each  OMU  in  the  analysis  there  is  associated  a  separate  linear  program 
which  determines  such  an  efficiency  ratio  6,  where  0  <  0  <  1  is  typically  implied. 

The  various  DEA  models  separate  the  set  of  n  DMUs  (D)  into  efficient  (E*)  and 
inefficient  (I*)  sets.  A  DMU  is  efficient  if  ^  =  1,  =  0,  and  s*  =  0.  Mathematically,  the 

members  of  E*  are  extreme  points,  extreme  rays,  or  lie  on  a  convex  surface  which,  when 
taken  with  their  associated  facets,  form  a  piecewise-linear  empirical  production  surface, 
or  efficient  frontier.  Managerially,  these  units  exhibit  best  practice  of  transformational 
efficiency,  relative  to  their  inefficient  peers. 

Otherwise,  a  DMU  is  deemed  inefficient  (i.e.,  when  6  <  1,8*  ^  0,  or  8°  0), 
and  the  observation  lies  within,  not  on,  the  efficient  frontier.  In  this  case,  the  unit 
uses  more  input  for  the  same  level  of  output  (or  produces  lower  output  for  the  same 
level  of  input)  as  another  unit,  or  some  combination  of  units.  For  each  DMU  d  6  I*, 
the  corresponding  hnear  program’s  solution  suggests  a  set  of  inputs  and  outputs  that 
would  make  d  efficient.  This  virtual  DMU  is  obtained  by  a  projection  (not  necessarily 
orthogonal)  onto  the  efficient  frontier,  and  is  expressed  as  a  linear  combination  of  d’s 
reference  set  The  reference  set  contains  those  efficient  DMUs  with  which  d  is  being 
directly  compared.  The  virtual  DMU’s  inputs  and  outputs  are  determined  as  Yj  =YX* 
and  Xj  =  XA*. 

Although  there  are  many  formulations  of  DEIA  models  (see  Table  1.1  and  Table 
1.2),  the  useful  results  in  each  case  are  similar  in  nature  and  are  to  be  determined  in 
practice  by  a  DEA  code.  A  complete  data  envelopment  analysis  produces,  for  each 
DMU:  (1)  its  efficiency  score,  6]  (2)  a  set  of  weights,  fM  and  u,  for  the  outputs  and 
inputs,  respectively;  and,  for  inefficient  units,  (3)  values  for  the  slack  variables  and 
8*,  (4)  a  set  of  reference  DMUs,  and  (5)  values  for  A,  from  which  a  virtual  DMU  can  be 
computed.  In  the  next  section,  we  describe  some  of  the  techniques  that  have  been  used 
to  expedite  the  calculation  of  these  items. 
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2.1.2.  Survey  of  Efficiency  Techniques 

In  this  discussion,  the  following  notation  will  be  used.  We  define:  D  =  {1, . . . ,  n}, 
the  index  set  of  DMUs;  (Bp  is  the  set  of  basic  (nonbasic)  variables  associated  with 
an  optimal  basis  for  an  envelopment-form  DEIA  model  apphed  to  DMU  j  €  D;  s^, 
and  sp  are  the  values  of  6,  8°,  and  s*,  respectively,  in  Bp,  \j  =  {i  €  D|Ai  €.  Bp  ;  \i  is 
the  reduced  cost  of  weight  Aj  in  a  given  optimal  solution;  E*  =  e  =  1,  8°*  —  0, 
and  sp  —  0};  and  I*  =  D  —  E*. 

2. 1.2.1.  Choice  of  Model  Form 

For  each  DEIA  model  (CCR,  BCC,  Additive),  either  the  envelopment  or  the  mul¬ 
tiplier  forms  may  be  used  to  compute  solutions.  Since  typically  (s  -I-  m)  n,  there  will 
be  fewer  constraints  and  a  smaller  basis  inverse  to  be  maintained  with  the  envelopment 
form,  used  by  virtually  all  DEA  codes.  This  results  in  (n  +  s  -I-  m)  variables  and  (s  +  m) 
or  (s  +  m  +  1)  constraints. 

Since  the  number  and  difficulty  of  pivots  in  a  linear  program  grows  more  rapidly 
in  the  number  of  constraints  than  in  the  number  of  variables,  this  choice  of  the  envelop¬ 
ment  model  becomes  an  increasingly  favorable  one  as  the  size  of  the  data  set  expands. 
This  advantage  can  be  further  exploited  as  the  analysis  reveals  the  memberships  of  E* 
andr. 

2.1. 2.2.  Early  Identification  of  Efficient  DMUs 

It  has  been  observed  in  [6,33)  that  if  i  €  D  and  i  G  Bp  j  €  D,  then  DMUj  is 
efficient.  This  is  shown  formally,  as  follows. 

Theorem  1.  For  any  j  G  D,  Aj  C  E*  . 

Proof.  (CCR  version)  Let  the  optimal  solution  to  the  primal  (envelopment)  problem 
have  a  set  of  basic  variables  B*.  As  shown  in  [41],  B*  must  contain  at  least  one  \j  >  0, 
j  G  D.  If  Xj  >  0,  then,  by  complementary  slackness,  the  corresponding  constraint  of  the 
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dual  problem  must  be  strictly  satisfied  and  /iVj  —  uXj  =  0.  Hence  ^YjfuXj  =  1,  and  a 
multiplier  set  has  been  foimd  for  which  the  dual  LP  associated  with  DMUj  would  result 
in  z  =  1,  the  maximum  attainable.  Hence,  DMUj  is  efiicient.  I 

This  theorem  holds  for  all  standard  DBA  models  and  has  many  consequences 
for  computational  efficiency  in  data  envelopment  analysis.  The  early  identification  of 
members  of  E  C  E*  permits:  (1)  bypassing  unsolved  subproblems  associated  with 
members  of  E,  if  values  of  the  weights  and  slacks  are  not  needed;  (2)  near-optimal 
advanced  starting  solutions  for  known  efficient  DMUs;  and  (3)  pricing  strategies  that 
give  priority  to  members  of  E.  The  use  of  observation  1  can  reduce  the  number  of  linear 
programming  subproblems  to  be  solved  in  the  analysis,  while  the  observations  2  and  3 
can  expedite  the  solution  of  subproblems  that  must  be  optimized. 

In  software  implementations,  the  status  of  each  DMU^,  and  its  associated  Xj,  can 
be  easily  maintained  as:  member  of  E  C  E*,  member  of  I  C  I*,  or  unknown  (member 
of  U  =  D  -  E  -  I).  When  the  subproblem  associated  with  DMUj  is  solved,  j  can  be 
added  to  E  or  1.  In  addition,  from  Theorem  1,  may  reveal  new  members  of  E — from 
the  reference  set,  if  DMUj  is  inefficient,  or  from  degenerate  basic  variables,  if  DMUj  is 
efficient. 

Because  of  the  computational  value  of  early  status  identification,  the  following 
corollary  is  also  useful  for  streamlining  the  solution  of  DEA  problems. 

Corollary  1.  If,  at  optimality,  A,  =  0,i  €  D,  then  i  G  E*. 

Proof.  Let  z*  be  the  optimal  solution  to  the  DEA  finear  programming  problem  for 
DMUj.  Let  z*  =  z*  —  where  N  is  the  current  set  of  indices  of  nonbasic 

variables.  Let  t  €  N  with  Ai  =  0.  By  letting  A*  enter  the  basis,  z*  does  not  change  in 
value.  By  Theorem  1,  DMU*  must  be  an  efficient  DMU.  I 
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This  corollary  supports  as  extended  search  for  members  of  E  associated  with 
the  nonbasic  variables.  Our  experience  is  that  there  are  often  a  substantial  number  of 
alternate  optimal  solutions  and  nonbasic  variables  with  zero  reduced  costs. 

2.1. 2.3.  Restricted  Basis  Entry 

Another  consequence  of  Theorem  1  is  that,  since  any  Xj  associated  with  an 
optimal  basis  corresponds  to  an  efficient  DMUj,  we  need  only  consider  those  variables 
associated  with  D— I  when  pricing.  This  was  publically  reported  by  the  authors  in  [18,19] 
but  discovered  independently  by  Ali  and  published  in  (6|.  This  computationally  powerful 
finding  permits  continual  restriction  of  the  pricing  operation;  as  inefficient  DMUs  are 
identified,  their  corresponding  decision  variables  are  dropped  from  subsequent  problems. 

This  narrowing  of  focus  to  the  D  —  I  set  has  a  ratchet-like  effect  on  the  size 
of  the  linear  programs  being  solved.  As  the  membership  of  I  increases,  the  effective 
sizes  of  unsolved  subproblems  (in  terms  of  number  of  variables)  become  progressively 
smaller  as  well.  Hence  the  “later”  subproblems  in  a  series  tend  to  solve  more  quickly 
than  the  “earlier”  ones.  As  will  be  shown,  this  has  a  dramatic  impact  on  the  solution 
effort  required,  especially  on  large-scale  problems. 

2. 1.2.4.  Candidate  List 

Candidate  lists,  and  other  multi-pricing  schemes,  are  standard  computational 
procedures  for  expediting  the  solution  of  linear  programs  [64,67,71,72,74].  These  basically 
involve  pricing  a  set  of  nonbasic  variables  to  form  a  short  list  of  promising  candidates  for 
entry  into  the  solution,  and  repeatedly  selecting  incoming  variables  from  the  list  (re¬ 
pricing  members  between  pivots)  before  replenishing  with  fi’esh  candidates. 
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This  mechanism  may  be  specialized  to  the  structure  of  data  envelopment  analyses 
as  well.  First,  the  restricted  basis  entry  procedure  eliminates  from  consideration  variables 
associated  with  I.  Further,  priority  for  hst  inclusion  can  be  given  to  members  of  E  over 
U,  since  any  basis  will  contain  only  slacks  and  members  of  E*  (6J. 

2. 1.2.5.  Degeneracy  and  Anti-Cvcling  Logic 

Note  that  if  DMUj  is  efficient,  the  optimal  basic  solution  for  the  envelopment 
model  can  be  comprised  of  6  =  Aj  =  1,  with  all  other  variables  equal  to  zero.  Ac¬ 
companying  such  highly  degenerate  solutions  is  a  greater  likelihood  of  degenerate  pivots, 
as  when  the  minimum  ratio  is  zero.  When  such  degeneracy  exists,  so  does  the  possi¬ 
bility  of  cycling,  or  a  repetition  of  bases  for  the  same  extreme  point  solution  leading 
to  non-convergence.  Although  practitioners  often  dismiss  this  possibiUty  as  remote,  our 
experience  indicates  that  cycling  in  DEA  codes  is  not  only  prevalent  but  likely,  in  the 
absence  of  anti-cycling  procedtires. 

Various  methods  have  been  proposed  for  the  avoidance  of  cycling  within  the 
simplex  method  for  linear  programming.  The  leading  approaches — lexicographic  ordering 
[30],  perturbation  [30],  and  Bland’s  rule  [26] — can  be  computationally  demanding  and 
hence  are  usually  invoked  only  imder  certain  conditions.  Some  researchers  support 
the  use  of  lexicographic  ordering  in  order  to  avoid  a  temporary  stalling  of  the  solution 
process,  noting  a  substantial  decrease  in  the  number  of  pivots  when  employed  [46].  Ali 
[6]  proposes  a  specialized  ratio  test  that  gives  preference  to  A  variables  over  slacks  when 
breaking  ties  for  zero  minimum  ratios,  and  yields  decreases  in  the  munber  of  pivots  and 
solution  times  on  reported  test  problems. 

Om  experience  indicates  that,  for  many  problems,  simple  scaling  of  the  problem 
data — ^by  variables  and  by  constraints,  normalizing  by  averages  ]25] — is  most  effective 
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in  lowering  the  incidence  of  stalling  and  cycling  |50,64,83].  In  those  cases  where  a  lack 
of  progress  is  detected,  the  invocation  of  the  lexicographic  ordering  procedure  quickly 
remedies  the  situation. 

2.1. 2.6.  Non-Arr.himfldefl.n  InfinitasimaJ.  f 

The  non-Archimedean  infinitesimal,  £,  in  the  objective  functions  of  envelopment- 
form  models  was  represented  in  early  DEIA  codes  using  relatively  small  positive  values 
(e.g.,  10~®).  Unfortimately,  the  effect  for  CCR  and  BCC  models  was  to  make  the 
results  of  any  analysis  dependent  on  the  value  chosen,  leading  to  incorrect  results  in 
many  cases  [9]. 

More  recently,  algorithmic  changes  have  been  proposed  to  avoid  such  data  depen¬ 
dencies  and  ensure  correctness  of  envelopment  analyses.  Specifically,  the  CCR  and  BCC 
objective  functions  are  recast  in  a  preemptive  priority  form:  minimize  P\d  -I-  /^(— Is*  — 
ls°).  Proposed  implementation  schemes  include  two-stage  optimization  [3),  and  a  multi¬ 
objective  pricing  form  of  the  simplex  method  [46].  This  approach  appears  to  have  re¬ 
solved  the  infinitesimal  issue  in  the  DEA  commimity. 

2. 1.2.7.  Preprocessing  of  Observations 

An  analysis  of  the  DMUs’  observations  can  lead  to  identification  of  members  of 
I  and  E  prior  to  the  application  of  optimization.  Also,  in  some  instances,  the  Ukelihood 
of  encoimtering  efficient  or  inefficient  imits  early  in  the  solution  process  can  be  increased 
when  DMUs  are  processed  in  a  particular  order. 

As  shown  in  [6],  DMUi  is  inefficient  if  it  is  dominated  by  any  DMUj,  that  is  if 
Xj  <  Xi  and  Vj  >  Yi.  In  the  BCC  and  additive  models,  DMUj  is  efficient  if  one  of 
its  outputs  (inputs)  has  a  unique  maximum  (minimum)  value,  or  it  has  the  largest  ratio 
Oj  =  lYj/lXj. 
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Processing  DMUs  in  Oj  order  is  reported  to  identify  members  of  E*  earlier  than 
random  order.  This  may  be  helpful,  especially  when  the  number  of  efficient  DMUs  is 
small,  but  it  is  unclear  whether  it  is  generally  preferable  to  solve  subproblems  associ¬ 
ated  with  E*  or  I*  in  the  early  part  of  the  process,  since  solutions  to  inefficient  DMU 
subproblems  tend  to  reveal  more  members  of  E*  than  the  solutions  to  efficient  DMUs, 
and  subproblems  for  members  of  E*  tend  to  be  more  degenerate  and  are  more  prone  to 
cycling. 

2. 1.2.8.  AHvancpd  Starting  Bases  and 
Reoptimization 

In  many  algorithms  that  involve  the  solution  of  a  series  of  related  linear  programs, 
or  subproblems,  the  use  of  reoptimization  can  lead  to  substantial  time  savings.  This  is 
accomphshed  by  using  B*  as  an  initial  basic  feasible  solution  for  the  LP  associated  with 
DMUj,  as  is  possible  when  subproblems  differ  by  one  or  more  parameters  (objective 
function  or  right-hand-side  values)  but  otherwise  have  the  same  mathematical  structure. 

DEA  subproblems  can  employ  reoptimization,  but  its  effectiveness  is  dependent 
on  the  mathematical  “closeness”  of  the  two  solution  points,  B*  and  B^.  Ali  [7]  suggests 
the  use  of  this  technique  only  for  j  €  I*. 

2.2.  PIONEER  Code 

We  have  developed  and  tested  PIONEER,  a  new  DEA  code  which  embodies  most 
of  the  above  efficiency  techniques.  The  objective  was  to  build  a  flexible  research  code  for 
testing  and  evaluating  alternative  solution  approaches  to  large-scale  problems. 

2.2.1.  Implementation  Overview 

pioneer’s  optimization  kernel  is  based  on  the  XMP  library,  a  collection  of 
portable,  reliable,  and  widely  used  Fortran  subroutines  for  the  solution  of  linear  pro¬ 
grams  using  the  primal  and  dual  simplex  methods  [70].  The  LP  basis  is  maintained 
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in  LU-factorized  form  with  pivoting  routines  from  the  Harwell  library.  The  availability 
of  source  code  permitted  customization  of  algorithmic  steps  for  DEA  problems,  and 
the  clean,  documented,  modularized  design  of  the  routines  provided  a  flexibility  that 
supported  and  simplifled  experimentation. 

Anti-cycling  logic  was  added  to  the  XMP  ratio  test.  While  the  aforementioned 
data  scaling  procedture  substantially  reduced  the  incidence  of  stalhng,  lexicographic  or¬ 
dering  is  invoked  when  lack  of  progress  is  detected. 

The  PIONEER  code  implements  all  varieties  of  DEIA  models  described  in  [81]: 
CCR,  BCG,  and  additive  with  both  input  and  output  orientations  and  two  variable- 
retums-to-scale  models  allowing  for  either  increasing  or  decreasing  returns  to  scale.  All 
of  the  efficiency  techniques  in  Section  1.2  are  employed,  except  for  preprocessing  and 
reoptimization.  (Our  testing  showed  that  reoptimization  was  of  uneven  value — sometimes 
reducing,  sometimes  increasing,  run  times — hence  was  not  used  in  the  final  testing.)  The 
candidate  list  is  not  prioritized,  and  the  two-stage  optimization  approach  to  the  non- 
Archimedean  infinitesimal  issue  is  employed. 

Auxiliary  data  structmes  maintain  the  E,  I,  or  U  status  of  each  DMU,  as  indicated 
by  the  following  outline  of  the  code’s  logic: 

Procedure  PIONEER 

1.  Initialize  U  =  D,  E  =  0, 1  =  0. 

2.  While  U  ^  0  : 

a.  Select  i  €  U. 

b.  Solve  the  subproblem  for  DMU*. 

c.  If  DMUj  is  efficient,  E  =  E  U  {i},  else  I  =  I U  {i}. 

d.  Update:  E  =  E  U  A<  U  {j|Aj  =  0},  and  U  =  D  —  E  —  I. 

The  strength  of  step  2d  depends  on  Theorem  1,  Corollary  1,  and  the  distribution  of 
observations  in  relative  to  the  efficient  frontier. 


37 


It  should  be  noted  that  while  the  XMP  data  structures  are  designed  for  sparse 
problems,  DEA  subproblems  are  almost  totally  dense.  This  results  in  an  uimecessary 
processing  overhead  in  the  current  implementation  that  can  be  eliminated  with  recoding. 
Hence  the  execution  times  reported  in  this  paper  should  be  considered  worst  case,  or 
at  least  conservative,  measures  of  performance,  relative  to  a  fully  optimized  and  tuned 
implementation. 


2.2.2.  Baseline  Computational  Testing 

To  evaluate  the  efficiency  of  the  PIONEER  research  code,  and  to  provide  a 
set  of  baseline  measurements  from  which  to  compare  algorithmic  and  implementational 
enhancements,  a  series  of  test  runs  were  made  on  medium-  and  lau-ge-scale  problems. 
Testing  with  and  without  individual  efficiency  techniques  lent  insight  into  their  impact 
on  solution  speed. 

2.2.2. 1.  Test  Data 

Although  the  code  was  validated  for  solution  accuracy  on  small  problems  from 
the  open  literature,  of  primary  interest  was  its  effectiveness  on  large-scale  problem  sets. 
For  testing  purposes,  real  life  data,  randomly  generated  data,  and  data  based  on  the 
economic  Cobb-Douglas  functional  form  were  employed. 

First,  the  Federal  Reserve  Bank  of  Dallas  provided  a  data  set  of  8,742  banks 
from  its  Southwest  district,  with  variables  consisting  of  the  six  inputs  and  three  outputs 
described  in  [20,21].  This  challenging  problem  from  industry  was  clearly  beyond  the 
state  of  the  art  of  DEIA  codes,  and  was  a  prime  motivator  for  this  research.  Testing  was 
performed  on  smaller  subsets  by  selecting  the  first  n  observations  from  the  unordered  file. 

Randomly  generated  observations  from  a  multinormal  population  provided  a  sec¬ 
ond  source  of  test  data.  Using  the  DRNMVN  routine  from  the  IMSL  library  and  the 
procedures  given  in  Appendix  A,  large  data  sets  could  be  easily  created.  The  variables 
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were  partitioned  into  3  inputs  and  4  outputs  so  as  to  observe  positive  correlations  be¬ 
tween  the  “input”  and  “output”  sets. 

Finally,  a  random  problem  generator,  DEA-GEN,  was  written  to  create  obser¬ 
vations  based  on  the  classic  Cobb-Douglas  form  of  production  surfaces.  As  detailed  in 
Appendix  B,  the  program  gives  the  user  a  measure  of  control  over  the  proportion  of 
efficient  points,  and  creates  data  sets  that  more  closely  approximate  realistic  economic 
processes  than  the  multinormal  generator. 

2.2. 2.2.  Test  Environment 

The  PIONEER  code  was  tested  on  Southern  Methodist  University’s  Sequent 
Symmetry  S81B  with  32MB  of  internal  storage  and  processing  units  consisting  of  16-MHz 
Intel  80386s  with  Weitek  coprocessors.  The  software  is  written  entirely  in  Fortran  and 
executed  imder  Dynix  3.0.12,  a  BSD-Unix-based  operating  system.  While  the  processors 
are  rated  at  4  million  operations  per  second,  in  terms  of  current  technology  they  are 
equivalent  to  relatively  slow  personal  computers. 

2.2.2.3.  Experimental  Results 

The  PIONEER  code  was  apphed  to  problems  from  each  of  the  three  sources.  The 
solution  times  reported  below  are  “wall-dock”  times,  or  elapsed  real  execution  times, 
exclusive  of  problem  input  and  output  on  the  Sequent  Symmetry.  Unsolved  subproblems 
associated  with  members  of  E  (as  determined  by  Step  2d)  were  bypassed. 

Table  2.1  describes  test  set  A,  which  was  made  up  of  banking,  multinormal,  and 
DEA-GEN  problems.  Three  problems  from  Federal  Reserve  banking  data  were  examined: 
the  first  1,000,  the  first  2,000,  and  the  first  8,000  DMUs  of  the  8,742-bank  data  set.  Two 
multinormal  data  sets,  for  n  =  1,000  and  n  =  2,000  were  generated  using  the  variance- 
covariance  matrix  and  procedures  given  in  Appendix  B.  Six  problems,  with  n  =  1,000 
and  n  =  2,000,  were  created  using  DEA-GEN  and  the  parameters  given  in  Appendix  C. 
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Table  2.1.--  PIONEER  solution  times  on  test  set  A 


Source 

s 

m 

DMUs 

No  RBE  Time 

RBE  Time 

Ratio 

FR  Bank 

3 

6 

33.29  min 

17.04  min 

FR  Bank 

3 

6 

2000 

179.91  min 

95.59  min 

0.532 

FR  Bank 

3 

6 

8000 

44.21  hour 

19.80  hour 

0.448 

Multinormal 

4 

6 

1000 

31.96  min 

18.72  min 

0.586 

Multinormal 

4 

6 

2000 

106.90  min 

61.65  min 

0.577 

DEA-GENa 

4 

6 

1000 

57.51  min 

33.13  min 

0.576 

DEA-GENa 

5 

3 

2000 

130.02  min 

75.94  min 

0.584 

DEA-GENb 

7 

4 

1000 

61.49  min 

37.96  min 

0.617 

DEA-GENb 

7 

2 

2000 

151.27  min 

92.42  min 

0.611 

DEA-GENc 

6 

6 

1000 

72.83  min 

43.29  min 

0.594 

DEA-GENc 

6 

5 

2000 

213.61  min 

126.50  min 

0.592 

Average  0.567 


Solution  times  for  these  problems  with  the  PIONEER  code  and  the  OCR  model  are  given 
in  Table  2.1. 

The  code  was  run  using  the  CCR°  model  with  and  without  the  use  of  restricted 
basis  entry  (RBE)  and  early  identification  of  eflBcient  units  (EIE)  to  examine  its  impact 
on  solution  time,  as  shown  in  Table  2.1.  In  all  cases,  the  RBE  procedure  had  a  strong 
impact,  cutting  solution  times  roughly  in  half.  The  17.04-minute  time  for  the  1,000- 
DMU  problem  indicated  that  the  PIONEER  code  is  reasonably  eflBcient  for  medium¬ 
sized  problems.  But,  even  with  the  help  of  RBE,  the  19.8-hour  solution  time  for  the 
8,000-DMU  problem  is  excessive  for  practical  usage. 

A  closer  examination  of  the  8000-DMU  bank  problem  (BANK-8)  solution  process 
gives  insight  into  the  sources  of  the  speed  improvements.  Figure  2.1  gives  the  time  to 
solve  each  set  of  1,000  subproblems  in  the  8,000  total,  both  with  and  without  RBE  logic. 
Note  that  when  RBE  is  not  used,  the  time  to  solve  each  set  of  1,000  subproblems  is 
roughly  the  same  (aroimd  4.1  hours).  But  when  RBE  is  employed,  the  last  group  of 
1,000  subproblems  is  solved  almost  four  times  faster  than  the  first  group.  By  restricting 


40 


the  known  inefficient  DMUs  from  entering  the  basis,  the  later  subproblems  are  smaller, 
and  easier  to  solve.  In  addition,  the  early  identification  of  efficient  DMUs  results  in  fewer 
subproblems  to  be  solved.  Typically,  fewer  than  20%  of  all  DMUs  are  efficient,  so  the 
faster  solution  time  can  be  attributed  mainly  to  restricted  basis  entry. 
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Figure  2.1.  Solution  time  for  subsets  of  BANK-8 


Figure  2.2  shows  the  cumulative  effect  of  RBE/EIE  on  solution  time  for  the  8,000- 
DMU  bank  problem.  While  the  performance  improvement  is  less  pronounced  in  the 
earlier  subproblem  groups,  the  2:1  ratio  becomes  evident  in  the  last  few  groups,  and 
trends  indicate  an  even  greater  disparity  might  result  for  larger  problem  sets. 

Although  PIONEER’S  solution  times  are  encouraging  and  indicate  that  the  code 
may  be  comparable  to  others  described  in  the  literature,  other  performance  improve¬ 
ments  are  possible.  The  next  section  describes  the  use  of  parallel  processing  to  further 
decrease  solution  times  for  these  and  other  problems. 
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Figure  2.2.  Cumulative  solution  time  for  subsets  of  BANK-8 


2.3.  Parallel  PIONEER  Code 

Because  of  the  nature  of  the  DEA  formulation,  an  advance  in  computer  technol¬ 
ogy,  parallel  processing,  can  be  utihzed  to  achieve  dramatic  reductions  in  solution  times. 
With  parailel  processing,  the  solution  for  a  given  problem  is  found  by  simultaneously 
coordinating  the  efforts  of  multiple  processors  to  accomplish  a  given  task  on  a  common 
body  of  data.  If  the  algorithm  associated  with  the  problem  solution  process  consists  of 
multiple  independent  tasks  that  can  be  properly  assigned  to  separate  processors,  dra¬ 
matic  reduction  in  solution  times  may  be  possible. 

Of  the  numerous  varieties  of  parallel  machine  architectures,  the  most  prevalent 
commercial  design  is  miUtiple-instruction,  multiple-data  (MIMD)  [19,56].  Parallel  com¬ 
puter  systems  are  composed  of  multiple  processors  which  have  the  ability  to  act  inde¬ 
pendently  on  problem  data  sets.  To  arrive  at  a  correct  solution,  coordination  of  all 
processors  and  their  tasks  is  necessary.  Two  basic  techniques  are  used  to  accomplish 
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this  coordination  activity  in  the  parallel  environment.  The  processors  can  communicate 
through  shared  memory.  If  the  time  to  access  a  particular  memory  location  is  identical 
for  all  of  the  processors,  the  system  is  called  tightly  coupled,  or  else,  the  system  is  termed 
loosely  coupled.  If  the  parallel  machine  contains  no  shared  memory  capabilities,  coordi¬ 
nation  of  processor  activities  must  be  accomplished  through  message  passing  between  all 
of  the  processors. 

As  with  traditional  single-processor  (serial)  machines,  solution  efficiencies  are 
directly  tied  to  how  well  the  algorithmic  steps  match  the  architecture  of  the  underlying 
machine.  Because  a  DBA  problem  involves  the  optimization  of  many  separate  linear- 
programming  subproblems,  the  use  of  MIMD-style  parallehsm  to  speed  solution  appears, 
on  the  surface,  to  be  a  “natural”  one.  In  fact,  the  mapping  of  the  DBA  solution  process 
to  a  tightly  coupled  MIMD  architecture  turns  out  to  be  an  ideal  example  of  the  use  of 
parallel  processing. 


2.3.1.  Parallel  Code  Design 

The  apphcation  of  parallel  processing  to  DBA  problems  was  first  reported  in 
PhilUps,  Parsons,  Donoho  [76],  where  four  transputers  were  run  from  a  Macintosh  Ilex 
on  a  54-DMU  problem.  Times  were  reduced  by  a  factor  of  three  in  this  loosely  coupled 
MIMD  implementation.  The  next  section  describes  a  very  diflFerent  computing  environ¬ 
ment  and  how  the  PIONBBR  code  was  modified  to  use  this  form  of  parallehsm. 

2.3. 1.1.  Target  Machine  Bnvironment 

As  with  software  designed  for  vector  processors,  parallel  codes  must  be  structured 
to  match  the  architecture  of  the  target  machine.  Our  test  machine  was  the  same  Sequent 
Symmetry  S81B  that  was  employed  for  serial  testing,  but  which  can  be  programmed  for 
parallel  processing.  The  system  has  a  tightly  coupled  MIMD  design,  with  20  16-MHz 
80386  processors,  Weitek  coprocessors,  and  32MB  of  sharable  memory. 
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The  Sequent’s  operating  system  permits  the  programmer  to  create  multiple,  in¬ 
dependent  processes  which  it  schedules  on  the  available  processors.  The  processes  can 
have  both  private  and  shared  data-spaces.  The  shared  spaces  can  be  read  and  written  by 
all  designated  processes,  and  can  be  used  for  interprocess  communication  and  to  avoid 
duplication  of  common  data. 

2.3. 1.2.  Parallelization  of  the  PEA  Algorithm 

This  type  of  parallel  machine  is  designed  to  be  most  effective  with  work  that 
can  be  decomposed  into  large,  independent  tasks.  The  DEA  solution  process  can  be 
organized  in  this  manner  through  the  use  of  data  partitioning,  where  multiple  identical 
processes  apply  the  same  program  steps  to  different  data.  By  considering  each  DMU’s  LP 
subproblem  to  be  a  separate  task,  such  large-granularity  work  decomposition  is  possible. 

In  the  parallel  PIONEER  code,  a  self-scheduling  approach  is  used,  where  pro¬ 
cesses  select  and  execute  tasks  from  a  shared  work  list,  on  a  first-come-first-served  basis. 
Although  incurring  a  minor  amount  of  coordination  overhead,  such  self-scheduling  per¬ 
mits  a  balanced  distribution  of  the  workload  across  processes,  an  important  characteristic 
when  individual  task  times  vary. 

Each  process  solves  its  LP  subproblems  in  its  private  memory;  shared  memory 
stores  the  original  problem  data  and  the  status  of  each  DMU.  Since  a  DMU’s  status — in 
terms  of  membership  in  E,  I,  or  U — may  change,  restricted  basis  entry  becomes  dynamic 
and  time-based.  At  one  moment,  a  given  Xj  variable  may  be  part  of  the  E  U  U  pricing  set 
and,  an  instant  later,  be  found  to  be  inefficient  and  ineligible  for  basis  entry.  The  shared 
status  array  automatically  conununicates  this  information  to  aU  processes.  This  is  an 
instance  of  a  race  condition,  or  timing-dependent  code,  which  can  result  in  stochastic 
solution  statistics  when  the  order  of  events  differ  from  nm  to  run  due  to  minute  timing 
differences. 
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2.3.2.  Testing  Parallel  PIONEER 


To  test  the  parallel  implementation  of  the  PIONEER  code,  a  variety  of  problems 
were  chosen  for  analysis.  Table  2.5  describes  the  characteristics  of  the  problems  chosen. 
The  original  bank  data  consisted  of  8,742  DMUs.  For  each  of  the  bank  problems,  the 
DM  Us  composing  the  data  set  were  randomly  chosen  from  the  original  data  set.  The 
multi-normal  and  DEA-GEN  data  sets  are  described  in  Appendices  A  and  B.  These 
problems  were  generated  to  simulate  real  life  data  of  large-scale  DEA  problems.  The 
CCRj  model  was  used  for  all  runs. 

The  “wall  clock”  time,  measured  in  minutes,  to  solve  each  DEA  problem,  exclu¬ 
sive  of  input  output  times,  are  given  in  Table  2.2.  For  the  problems  consisting  of  8,000 
and  4,000  DMUs,  the  niunber  of  processors  used  was  limited  by  the  available  memory. 
Tables  2.3  and  2.4  reveal  the  speedup  and  efficiency  of  the  parallel  approach.  In  all 
cases,  relative  speedup  was  nearly  linear.  Isolated  examples  indicate  that  superlinear 
speedup  may  be  possible  because  of  the  use  of  RBE/EIE  in  the  PIONEER  code.  Since 
the  PIONEER  code  is  asynchronous,  i.e.,  each  LP  can  be  solved  independently  of  all 
other  LPs,  the  parallelization  is  highly  effective  at  improving  solution  times. 


Table  2.2.  -  Test  problems  parallel  run  times  (min.) 


#  Procs 

Bank 

1000  4000  8000 

Multi-Normal 

4000 

DEA-GENa 

2000 

DEA-GENb 

2000 

DEA-GENc 

2000 

1 

17.04  283.22  1189.81 

242.96 

75.94 

92.42 

126.50 

2 

8.70  151.23  589.22 

122.35 

46.52 

63.63 

3 

5.68  101.42  389.46 

81.81 

25.58 

31.06 

42.48 

4 

4.27  73.26 

61.39 

19.19 

23.39 

31.91 

5 

3.40  61.27 

49.14 

15.35 

18.71 

25.57 

6 

2.86 

12.82 

15.60 

21.32 

2.45 

11.02 

13.38 

18.28 

2.15 

9.65 

11.74 

16.02 

1.96 

8.57 

10.45 

14.26 

10 

1.72 

7.72 

9.40 

12.80 

15 

1.17 

5.19 

6.33 

8.61 
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Table  2.3.  —  Test  problems  parallel  speedups 


#  Procs 

mBui 

Multi-Normal 

4000 

inaRjmDIj^ 

1 

1.00  1.00  1.00 

1.00 

1.00 

1.00 

2 

1.96  1.87  2.02 

1.99 

1.99 

1.99 

1.99 

3 

3.00  2.79  3.06 

2.97 

2.97 

2.98 

2.98 

4 

3.99  3.87 

3.96 

3.96 

3.95 

3.96 

5 

5.01  4.62 

4.94 

4.95 

4.94 

4.95 

6 

5.96 

5.92 

5.92 

5.93 

6.96 

6.89 

6.91 

6.92 

7.93 

7.87 

7.87 

7.90 

8.69 

8.86 

8.84 

8.87 

10 

9.91 

9.84 

9.83 

9.88 

15 

14.56 

14.63 

14.60 

14.69 

Table  2.4.  -  Test  problems  parallel  efficiencies 


#  Procs 

Bank 

1000  4000  8000 

Multi-Normal 

4000 

jgii 

DEA-GENb 

2000 

DEA-GENc 

2000 

1.000  1.000  1.000 

1.000 

1.000 

0.979  0.936  0.979 

0.994 

0.994 

1.000  0.931  1.000 

BUH 

0.993 

0.998  0.966 

0.989 

0.991 

5 

1.002  0.924 

0.989 

0.989 

6 

0.993 

■■■ 

0.987 

BB 

0.989 

0.994 

0.984 

0.989 

0.991 

0.984 

0.984 

0.987 

0.966 

0.985 

0.983 

0.986 

10 

0.991 

0.984 

0.983 

0.988 

15 

0.971 

0.975 

0.973 

0.979 
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2.3.3.  Limits  of  Parallelism 


The  computational  testing  indicated  that  the  parallel  PIONEER  code  was  a 
highly  scalable  MIMD  apphcation  that  fully  utilized  the  multiprocessing  capabiUty  of 
the  given  computer  system.  We  feel  that  the  software  would  also  exhibit  much  of  the 
same  efficiency  if  implemented  in  a  loosely  coupled  MIMD  enviroiunent.  In  the  latter 
setting,  changes  in  a  DMU’s  status  would  have  to  be  broadcast  to  all  processors,  thus 
incurring  additional  overhead  and  a  latency  in  commumcating  the  information.  While 
unnecessary  work  (i.e.,  avoidable  in  a  shared-memory  environment)  might  result,  the  use 
of  a  larger  number  of  processors  could  more  than  offset  this  disadvantage. 

In  our  testing,  memory  size  limited  the  dimensions  of  problems  that  could  use  the 
full  parallelism  of  this  system.  We  beUeve  that  additional  internal  storage  would  permit 
the  same  excellent  speedups  on  the  larger  problems  as  was  observed  on  the  smaller  ones. 
(This  notion  was  verified  by  prehminary  testing  on  a  larger  system.) 

Even  with  these  encouraging  results,  we  felt  that  further  improvements  were 
needed  and  possible.  In  fact,  a  close  examination  of  the  parallel  solution  statistics  led 
to  a  new  procedure  which  further  reduced  all  times — both  serial  and  parallel — by  up  to 
an  order  of  magnitude. 


2.4.  Hierarchical  Decomposition 

Experimentation  with  sets  of  problems  that  were  identical  except  for  the  number- 
of  DMUs  yielded  solution  times  such  as  those  given  in  Figure  2.3.  Not  only  did  the 
memory  requirements  of  larger  problems  fimit  the  amount  of  usable  paralleUsm,  but 
run  times  grew  exponentially  in  n.  Hence  if  a  larger  problem  could  be  decomposed 
into  a  series  of  smaller  ones,  lower  memory  requirements,  greater  parallelism,  and  faster 
individual  solution  times  might  offset  any  additional  work  that  might  be  required. 
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1000  2000  3000  4000  5000  6000  7000  8000 

Number  of  DMUs 

Figure  2.3.  Solution  time  versus  problem  size 

2.4.1.  DEIA.  Decomposition  Background 
Consider  a  partitioning  of  the  set  of  DMUs  into  a  series  of  k  mutually  exclusive 
and  collectively  exhaustive  subsets  D<  c  D,  where  D  =  UieK  HieK  ~ 

K  =  k}.  Define  E(Dt)  and  I(D<)  to  be  the  index  sets  of  DMUs  in  D*  that  are 

eflBcient  and  inefficient,  respectively,  relative  to  Di  (i.e.,  D*  =  E(Dt)  U  I(Di)),  based  on 
the  meaning  of  efficiency  in  the  DEA  model  of  interest. 

Theorem  2.  if  D*  C  D,  then  I(Di)  C  I*  and  E*  C  E(Di). 

Proof  (CCR*  version).  Define  Zj{Di)  to  be  the  optimal  value  of  z  for  DMU  j  relative  to 
the  set  Dj.  We  know  that  Zj(D)  <  ■zJ(Dj),y7  €  D*  in  this  maximization  problem,  since 
there  are  more  model  constraints  associated  with  D  than  Di. 

Assertion  I(Di)  C  I*: 

DMU  j  is  inefficient  (efficient)  relative  to  set  S  if  2^(8)  <  1  (=  1).  Assume  j  €  D,  has 
Zj(Di)  <  1,  and  hence  is  inefficient.  Since  2^(D)  <  2^(Di),  j  will  be  inefficient  relative  to 
D. 
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Assertion  E*  C  E(Di): 

Since  E*  C  D  =  UigKD*,  then  E*  C  u*  (E(D»)  U  I(Dj)).  From  above,  E*  n  Uil(Dj)  =  0, 
therefore  the  assertion  must  be  true.  I 

Hence  if  a  DMU  is  inefficient  in  a  subproblem,  it  is  inefficient  for  the  full  prob¬ 
lem;  if  it  is  efficient  for  a  subproblem,  it  may  or  may  not  be  efficient  overall.  These 
relationships  provide  the  foimdation  for  the  following  decomposition  procedure  for  DEIA 
models. 


2.4.2.  The  Hierarchical  Decomposition  Procedure 
The  following  approach  to  DEIA.  problems  solves  a  series  of  subproblems  which  are 
arranged  in  a  hierarchy.  The  subproblems  are  themselves  DEA  problems  created  from 
subsets  of  the  DMUs  in  D.  They  can  be  solved  independently,  but  information  about  set 
I  membership  generated  from  the  subproblems  can  accelerate  the  solution  of  others. 

First  we  define  a  procedure  for  creating  and  solving  the  subproblems.  It  partitions 
the  set  of  DMUs  whose  efficiency  is  unknown  (U)  into  a  series  of  subsets,  or  blocks^  of 
size  6.  Recall  that  U  =  D  —  E  —  I  and  |U|  is  the  cardinality  of  U. 

Procedure  SolveBlocks(6, I): 

1.  Partition  U  into  k  =  flUl/ft]  mutually  exclusive,  approximately  equal-sized  blocks 
of  DMUs. 

2.  For  each  block  Bj,  i  €  K,  K  =  {1, . . . ,  k}: 

(a)  Apply  a  DEA  envelopment  model  to  compute  Ef  =  E(Bt),  using  early 
identification  of  efficiency  and  restricted  basis  entry. 

(b)  SetI  =  lUl(Bi). 

This  is  used  in  the  following  hierarchical  decomposition  procediue  (HDEA).  A 
graphical  representation  of  the  process  is  given  in  Figure  2.4. 
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Blockj  Block2  Block3 


Blocks 


Procedure  HDEA(6,a,/3): 

Level  1:  (Initial  pass) 

1.  I  =  0,U  =  D,£^  1. 

2.  SolveBlocks(6,£,I). 

Level  2:  (Identify  E*  and  P) 

while  (  U  7^  0  )  do: 

1.  £^e+  1. 

2.  |U|. 

3.  SoIveBlocks(6,  £,I). 

4.  if  lU|/u  >  a,6  t— |U|,  else  6  <— /?fc. 

Level  3:  (Resolution  ofV) 

Re-solve  the  DEA  model  (with  basis  entry  restricted  to  E*)  for  members  of  I*  to 
compute  correct  solution  values. 
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2.4.3.  Implementation  Considerations 

HDEA  is  a  divide-and-conquer  algorithm  similar  in  nature  to  merge  sorting  and 
sorting  networks  [49,66]:  the  original  problem  broken  into  severed  smaller  subproblems, 
which  are  solved  recursively  and  the  resvilts  ctrubined  to  solve  the  original  problem. 
The  blocksize  parameter,  6,  defines  sets  of  linear  programs  with  the  same  number  of 
constraints  as  the  original  problem,  but  many  fewer  variables.  This  results  in  lower 
memory  requirements  and  faster  solutions  for  the  subproblems,  although  more  linear 
programs  may  be  involved.  Because  of  this  increased  speed  and  Theorem  2,  these 
easier  problems  can  eliminate  inefficient-DMU  variables  earher  than  with  non-hierarchical 
approaches. 

The  HDEA  procedure  focuses  initially  on  isolation  of  E*  (in  levels  1  and  2), 
so  as  to  expedite  solution  of  the  subproblems  associated  with  I*  (in  level  3).  The 
method  should  be  particularly  effective  when  |E*|  |I*|,  as  is  typically  the  case. 

The  decomposition  into  subproblems  with  minimal  data  communication  requirements  is 
highly  attractive  from  a  parallel  processing  standpoint. 

Memory  requirements  are  a  function  of  max{6,  |E*|}.  If  primary  storage  is  at  a 
premium,  its  use  can  be  minimized  by  paging  in  DMU  data  from  external  storage  for 
each  subproblem  separately.  In  parallel  implementations,  participating  processes  might 
place  in  shared  memory  a  copy  of  the  X  and  Y  data,  plus  a  DMU-status  array;  each 
process  would  also  need  its  own  private  storage  for  solver-specific  information  such  as  a 
simplex  basis  inverse  and  candidate  list. 

The  choice  of  blocksize  also  affects  overall  solution  time.  Since  b  infiuences  the 
tradeoff  between  the  size  and  the  number  of  subproblems  solved,  computational  testing 
for  an  appropriate  setting  is  required.  Figure  2.5  shows  the  total  solution  time  in  minutes 
for  various  values  of  6  on  an  example  problem. 
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Figure  2.5.  Total  solution  time  vs.  blodcsize  for  BANK-8 


2.4.4.  Computational  Testing 

For  testing  the  HDEA  procedure,  real  life  data,  randomly  generated  data,  and 
data  based  on  the  economic  Cobb-Douglas  functional  form  were  employed.  For  each 
test  problem,  an  array  containing  the  status  of  each  DMU  was  maintained  in  shared 
memory.  A  private  copy  of  the  appropriate  data  was  given  to  each  processor  in  parallel 
so  that  each  DMU  could  be  solved  independently.  Since  the  status  of  each  DMU  could  be 
updated  without  conflict  from  other  processors,  no  locks  were  needed  during  the  status 
update  and  as  a  result,  the  parallel  implementation  is  pmely  asynchronous. 

All  times  reported  are  wall  clock  times,  in  minutes,  to  solve  the  problems  ex¬ 
clusive  of  any  input /output.  The  single  processor  times  represent  the  best  achieved 
serial  times  across  the  difiering-sized  DEA  problems.  The  parallel  case  is  a  direct  im¬ 
plementation  of  the  serial  fonnulation,  requiring  only  a  few  modifications  for  the  parallel 
environment  to  accompUsh  the  self-scheduling  parallel  implementation.  It  is  important 
to  note,  none  of  the  nms  were  made  in  a  dedicated  environment,  and  hence,  times  are 
subject  to  system  load.  However,  precautions  were  taken  to  conduct  the  runs  during  non¬ 
peak  horn’s  so  as  to  minimize  the  confoimding  of  outside  factors  on  the  solution  times. 
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For  the  test  problems  shown,  a  blocksize  of  250  was  used  for  the  8,000  and  4,000 
DMU  cases,  125  for  the  2,000  DMU  cases,  and  100  for  the  1,000  DMU  cases.  The  eflFects 
of  different  blocksizes  must  be  investigated,  hence,  these  results  should  not  be  viewed  as 
the  best  possible.  During  level  2  of  the  HDEA  procedure,  /?  =  1.5.  As  a  result,  each 
subsequent  block  at  level  2  grows  by  50%  until  all  DMUs  have  been  classified  as  either 
efficient  or  inefficient. 

Table  2.5  shows  that  for  all  test  runs  the  speedup  was  nearly  finear.  Larger  more 
difficiilt  problems  resulted  in  better  speedup  than  smaller  problems.  With  the  smaller 
problems,  the  overall  solution  time  in  parallel  is  so  small  that  the  overhead  of  maintaining 
the  self-scheduling  tasks,  as  well  as  possible  workload  imbalance  between  processors, 
becomes  apparent.  Consequently,  although  the  parallel  results  for  small  problems  are 
quite  good,  the  parallel  implementation  will  be  more  efficient  for  large,  difficult  problems. 

Table  2.5  also  gives  the  total  number  of  pivots  and  pricing  operations  executed  to 
find  the  DEA  scores  for  each  data  set.  These  numbers  vary  with  the  number  of  processors 
because  of  the  method  used  to  update  the  status  of  each  DMU.  As  expected,  problems 
with  a  greater  number  of  DMUs,  and  those  with  a  larger  density  of  efficient  DMUs, 
require  more  pivots  and  pricing  operations  and,  consequently,  take  longer  to  solve. 

Tables  2.6  through  2.10  contain  the  results  of  each  test  problem  for  each  level 
of  the  HDEA  procedure.  The  speedup  remains  relatively  consistent  across  all  problems 
across  all  levels  and  is  nearly  linear. 

The  effects  of  using  early  identification  of  efficient  DMUs  to  reduce  the  number  of 
linear  programming  problems  that  must  be  solved  are  shown  in  Tables  2.11  through  2.15. 
With  the  HDEA  procedure,  the  total  number  of  finear  programs  that  must  be  solved 
is  slightly  more  than  twice  the  original  number  of  DMUs  contained  in  each  data  set. 
However,  since  the  HDEA  finear  programs  are  smaller  than  other  DEA  methods,  overall 
performance  is  improved.  EIE  improves  performance  further  by  reducing  the  number  of 
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linear  programs  at  level  1  and  level  2  by  10-25%.  Ewly  identification  has  no  effect  at 
level  3  since  only  DEA  scores  for  inefficient  DMUs  are  found  at  that  level. 

Table  2.16  shows  the  dramatic  effect  of  the  hierarchical  procedure  on  improving 
solution  time  for  the  8,000  DMU  cases  over  the  best  DEA  approach  reported  to  date. 
Only  three  processors  are  used  in  the  parallel  implementation  because  of  memory  hmi- 
tations  of  the  non-hierarchical  procedure.  The  HDEA  procedure  results  in  a  6-  to  12- 
fold  increase  in  speed  over  the  non-hierarchical  procedure  when  both  cases  are  rim  with 
the  same  number  of  processors.  Because  the  HDEA  procedure  requires  less  memory, 
permitting  use  of  more  processors,  the  cases  with  15  processors  are  75  to  125  times  faster 
than  the  serial  non-hierarchical  procedure. 

Although  the  test  problems  were  limited  to  8,000  DMUs  because  of  the  availabil¬ 
ity  of  real  life  data,  the  HDEA  procedure  can  accommodate  much  larger  problems.  A 
Cobb-Douglas  test  case  (Appendix  B)  consisting  of  25,000  DMUs  with  3  inputs  and  3 
outputs  was  solved  using  15  processors  in  19.26  minutes.  Because  of  memory  limitations, 
it  was  not  possible  to  solve  this  problem  with  the  non-hierarchical  procedure. 

As  seen  with  the  test  problems,  the  HDEA  procedure  allows  for  the  solution  of 
large-scale  DEA  problems  much  faster  than  previously  reported  DEA  methods.  When 
coupled  with  a  parallel  environment,  the  HDEA  procedure  yields  solutions  to  problems 
in  a  matter  of  minutes  which  previously,  would  have  taken  days.  Additionally,  the 
HDEA  procedure  allows  for  the  solution  of  very  large-scale  DEA  problems  that  remain 
unsolvable  (in  a  practical  sense)  with  any  other  reported  approaches. 

2.4.5.  Further  Advantages  of  the  HDEA  Approach 
Accompanying  the  HDEA  method’s  advances  in  sheer  solution  speed  are  a  variety 
of  additional  computational  advantages  in  other  settings.  The  hierarchical  structure  can 
be  exploited  when  solving  multiple  DEA  models  and  further  streamlining  opportunities 
await  exploration. 
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2.4.5. 1.  Solving  Multiple  Models 


There  axe  numerous  instances  in  which  multiple  models  must  be  solved  for  each 
DMU  under  consideration.  The  HDEA  approach  can  be  easily  modified  and  extended  to 
efficiently  determine  all  of  the  required  values. 

•  To  solve  for  technical,  scale,  and  overall  efficiency  (inefficiency),  both  the  BCC 
and  CCR  models  must  be  solved  (85).  With  HDEA,  We  can  use  the  BCC  or 
ADD  model  through  level  2  to  find  EJ^^.  On  only  the  E^^,  set  we  can  apply  the 
CCR  model  formulation  to  find  the  E*cr  {Q  set  of  DMUs.  During  level  3, 
while  determining  the  inefficient  scores,  we  can  solve  the  values  for  IJcr-  Then, 
as  an  advanced  starting  basis,  we  can  allow  all  EJ^  DMUs  to  enter  the  basis  and 
solve  for  (Alternately,  we  could  solve  for  the  I^cc  values  first,  then  relax 
the  BCC  lA  =  1  constraint,  tighten  the  basis  entry  rules,  and  solve  for  the  Iccr 
scores).  Once  the  BCC  and  CCR  values  are  found,  we  can  rapidly  solve  for  the 
three  efficiency  values  desired.  Note  that  this  prevents  a  duphcation  of  effort  to 
find  the  inefficient  values. 

•  Byrnes,  Fare,  and  Grosskopf  [28j  argue  that  a  “no  increasing  returns  to  scale” 
(NIRS)  model  m\ist  be  solved  to  determine  if  the  DMU  (or  the  virtual  DMU  if 
inefficient)  is  in  a  region  of  increasing  or  decreasing  returns  to  scale.  Knowing 
this  may  help  indicate  the  direction  the  DMU  should  go  to  become  efficient, 
either  expand  operations  through  output  augmentation,  or  downsize  through 
input  contraction.  We  know  that  E^^  C  E*^^,  C  EJ^^.  Consequently,  the  NIRS 
strategy  could  be  added  to  the  strategy  above  to  find  these  scores. 

•  Besides  solving  for  BCC,  CCR,  NIRS  models,  we  can  also  apply  assurance  region 
restrictions  [84,85]  once  the  E*  for  each  model  is  found  at  level  2.  The  varioiis 
assurance  region  approaches  can  be  solved  at  level  2  then  extended  to  level  3.  In 
this  way,  a  series  of  models  can  be  solved  rapidly  without  duplication  of  effort. 
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We  know  EJr  Q  E*  for  any  of  the  above  models.  Note:  this  same  approach  can 
be  used  for  the  cone-ratio  model  of  Chames,  et  al  [34,35]. 

•  For  the  CCR  and  BCC  models,  there  is  an  associated  orientation,  either  input 
or  output.  The  ineflBciency  scores  will  vary  depending  on  the  orientation,  but 
the  and  E*cr  sets  will  remain  the  same  regardless  of  the  orientation.  Con¬ 
sequently,  if  both  orientations  are  needed,  then  only  the  level  3  values  must  be 
resolved  to  obtain  both. 

•  Some  versions  of  the  model  may  provide  faster  solutions  than  others  depending  on 
the  number  of  output  and  input  variables  that  are  used.  For  example,  if  the  num¬ 
ber  of  outputs  exceeds  the  number  of  inputs,  the  output-oriented  model  solves 
faster.  Also,  the  CCR°  model  has  an  initial  feasible  solution,  thus  bypassing 
Phase  I  of  the  simplex  method.  Since  the  output  model  may  solve  faster,  this  can 
be  used  through  level  2,  then  the  input  model  applied  in  level  3  calculations  if  the 
input  values  are  needed.  HDEA  provides  the  flexibility  to  incorporate  the  fastest 
model  to  identify  E*  through  level  2,  then  at  level  3  can  use  the  model  of  choice 
to  determine  the  inefficiency  scores. 

2.4.5.2.  Tmnlementation  Issues 

•  As  noted  above,  any  method  can  be  used  to  find  the  set  of  DMUs  belonging  to  E* 
and  I*  such  as  sorting,  preprocessing,  domination  theory,  etc.  These  can  possibly 
enhance  the  HDEA  procedure  to  expedite  the  level  2  and  level  3  efficiency  score 
values. 

•  Computational  concerns  have  arisen  over  the  two-phase  |4]  or  [46]  approaches, 
which  can  be  lessened  via  HDEA.  We  know  that  the  efficiency  scores  do  not  vary 
with  the  smn-of-slack  solutions.  So  in  HDEA,  there  is  no  need  to  solve  for  the 
sum  of  slacks  during  level  1  or  most  of  level  2.  Once  potential  E*  DMUs  are 
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identified,  the  sum  of  slacks  must  be  solved  only  for  these  DMUs  to  determine 
if  they  are  weakly  efficient.  If  there  are  any  weakly  efficient  DMUs,  they  are 
removed  from  E*.  During  level  3,  there  is  no  need  to  solve  for  the  sum  of  slacks  for 
the  inefficient  DMUs  since  a  proper  projection  to  the  efficient  surface  is  obtained 
since  no  weakly  efficient  DMUs  can  enter  the  basis.  Solving  for  the  sum  of  slacks 
for  the  inefficient  DMUs  may  simply  identify  possible  alternate  optimal  solutions 
that  offer  no  new  information  when  weakly  efficient  DMUs  are  not  in  the  basis. 

•  As  the  number  of  input /out  put  variables  grows,  the  hkelihood  of  cychng  also 
increases  because  of  severe  degeneracy  involved  with  the  solution  of  some  DMUs 
(especially  efficient  DMUs).  Anti-cychng  procedures  invoke  a  high  computational 
cost.  Also,  the  possibihty  of  cycling  can  increase  with  a  larger  number  of  DMUs 
(variables)  in  the  model.  HDEA  can  help  reduce  the  cost  of  cycling  by  main¬ 
taining  smaller  problems.  Additionally,  at  level  1  and  most  of  level  2  there  is 
no  need  to  invoke  the  anti-cycling  rules.  Any  DMUs  that  exhibit  cycling  can  be 
deferred.  Only  when  all  potential  members  of  E*  are  identified  at  level  2,  must 
the  anticycling  rules  be  invoked.  By  passing  over  the  DMUs  that  cycle,  the  basic 
variables  of  other  DMUs’  solutions  may  later  reveal  the  problematic  DMUs  as 
efficient.  In  this  maimer,  the  DMUs  will  not  need  to  be  solved  and  the  cost  of 
applying  anti-cycling  rules  is  reduced. 

2.4.5.3.  Exploiting  the  Levels 

•  As  we  progress  through  level  1  to  level  2  to  level  3,  information  at  each  level  can 
be  used  to  enhance  the  solutions  at  the  next  level.  DMUs  that  frequently  show 
up  in  efficient  reference  sets  can  be  given  priority  during  the  pricing  procedure 
to  enter  the  basis.  By  choosing  the  most  attractive  variables  to  enter  the  basis, 
the  number  of  pivots  to  solve  the  LPs  may  be  reduced.  This  may  also  reduce 
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the  potential  for  cycling.  This  also  indicates  the  possibihty  of  nsing  different 
heuristics  for  the  entering  and  leaving  variables  for  the  different  levels  of  the 
HDEA  process. 

•  One  advantage  of  the  HDEA  process  over  domination  and  other  preprocessing 
techniques  is  that  it  can  be  used  to  find  DEA  scores  for  categorical  variables 
and  window  analysis  enroute  to  determining  overall  DEIA  scores.  By  blocking 
appropriately,  the  efficient  DMUs  for  each  category  or  window  can  be  recorded 
along  the  way.  In  this  way,  solving  overadl  scores  as  well  as  those  within  particular 
categories  or  across  various  categories  can  be  accompfished  without  duphcation  of 
effort. 

2.4.5.4.  Parallel  Implementation 

•  The  HDEA  procedure  can  expand  the  parallel  approach  across  many  platforms. 
For  example,  a  SIMD  distributed  network  can  be  used  to  solve  each  block  of 
DMUs.  Since  the  blocks  are  smaller,  memory  requirements  are  smaller.  The 
advantage  of  updated  restricted  basis  entry  and  early  identification  of  DMU 
status  within  blocks  will  be  lost,  but  the  basic  process  will  still  lead  to  a  solution. 
The  basis  entry  and  identification  schemes  wiU  hold  between  levels. 

•  For  the  parallel  case,  Amdahl’s  law  assmnes  that  the  same  information  available 
for  the  single  processor  as  for  multiple  processors.  But  with  RBE  and  early 
identification,  the  interaction  between  processors  ''an  enhance  the  solution  process 
resulting  in  high  levels  of  efficiency  when  multiple  processors  are  used.  This  is  not 
an  HDEA-exclusive  advantage,  but  it  does  hold  for  the  HDEA  process. 
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2.5.  Conclusions 


We  have  described  a  new  hierarchical  decomposition  procedure  for  solving  DEA 
problems  that  advances  the  state  of  the  art  for  computational  data  envelopment  analysis. 
As  demonstrated  with  medium-  and  large-scale  test  sets,  this  approach  can  have  dramatic 
benefits  over  traditional  methods — in  both  single-processor  and  parallel  settings — and 
permits  enormous  problems  to  be  optimized  in  a  modest  amoimt  of  time.  The  ability  to 
routinely  solve  problems  with  thousands  of  DMUs  permits  researchers  and  practitioners 
to  be  more  ambitious  in  their  application  of  this  important  class  of  models  and,  we  hope, 
will  encourage  new  and  even  more  exciting  applications  of  DEA. 
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CHAPTER  III 


RANK  ORDERING  DMUS  USING  TIERED  DEA 

Data  envelopment  analysis  was  originally  developed  for  efficiency  measurement. 
The  primary  focus  to  date  has  been  on  the  development  of  a  set  of  DEA  models  to 
identify  the  efficient  and  inefficient  DMUs  under  different  sets  of  assumptions  designed 
to  measure  various  types  of  inefficiencies.  In  spite  of  the  tremendous  growth  and  success 
of  the  DEA  methodology,  there  has  been  httle  research  into  methodologies  for  ordering 
the  DMUs  according  to  the  efficiency  of  their  production  and  management  practices. 
The  inability  to  rank  DMUs  by  the  comparative  degree  of  efficiency  or  inefficiency  Umits 
the  potential  for  DEA  to  fuUy  characterize  successful  and  unsuccessful  management 
practices. 

Although  inefficient  DMUs  receive  a  DEA  score  which  reflects  the  degree  of 
inefficiency,  a  direct  comparison  of  DMUs  is  problematic  unless  both  DMUs  have  the 
same  efficient  reference  set  (ERS).  To  illustrate.  Figure  3.1  depicts  a  unit  isoquant  plot 
of  a  set  of  firms  with  one  output  and  two  inputs.  Since  each  point  represents  a  firm 
and  all  firms  produce  the  same  level  of  output,  the  more  efficient  units  are  closer  to  the 
origin.  The  efficient  frontier  is  formed  by  DMUs  A,  B,  C  and  D.  DMUs  E,  F,  and  G  are 
all  inefficient.  The  efficiency  of  each  point  is  determined  by  a  ratio  whose  denominator 
is  the  length  of  the  fine  segment  from  the  point  to  the  origin  and  the  munerator  is  the 
length  of  the  fine  segment  from  the  origin  to  the  efficient  boundary.  The  line  connecting 
an  inefficient  DMU’s  point  and  the  origin  will  intersect  one  of  the  fine  segments  forming 
the  efficient  frontier.  The  end  points  of  this  line  segment,  composed  of  efficient  DMU 
points,  form  the  efficient  reference  set  for  the  inefficient  DMU. 
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Acx:ording  to  Chames  and  Cooper  [32],  to  compare  DMUs  with  difierent  efficient 
reference  sets  would  require  assumptions  of  the  weighting  (or  pricing)  scheme  used  by 
DEA.  But  it  is  precisely  this  lack  of  restrictions  on  the  weighting  scheme  that  makes 
the  DEA  methodology  so  attractive.  Consequently,  in  general  such  assumptions  are 
undesirable.  In  our  example,  DMUf  can  be  compared  to  DMUg  because  they  share  the 
same  ERS  consisting  of  DMUc  and  DMUj).  In  this  case  DMUg  with  an  efficiency  score 
of  0.800  is  more  efficient  than  DMU p  with  an  efficiency  score  of  0.774.  However,  neither 
of  these  DMUs  should  be  compared  with  DMU^,  with  an  efficiency  score  of  0.733,  which 
has  a  different  ERS  composed  of  DMUs  B  and  C.  Hence,  for  inefficient  DMUs,  a  new 
approach  is  necessary  to  further  discriminate  and  allow  comparisons  across  all  inefficient 
DMUs. 


Figure  3.1.  Normalized  data  isoquant  plot 

For  efficient  DMUs,  the  relative  importance  of  each  DMU  is  difficult  to  discern. 
Because  each  such  unit  has  a  DEA  score  of  1,  there  exists  no  variation  in  scores  to 
determine  a  relative  value.  Chames  and  Cooper  [32]  suggested  a  tool  which  they  called 
the  envelopment  map  to  characterize  the  magnitude  of  the  importance  of  each  efficient 
DMU.  This  method  consists  of  covmting  the  number  of  times  each  efficient  DMU  ap>- 
peared  as  a  member  of  an  ERS.  Those  DMUs  occurring  more  often  would  be  considered 
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more  “consistently  efficient.”  However,  there  axe  at  least  two  problems  with  this  measure. 
First,  to  correctly  count  all  occurrences  of  an  efficient  DMU  in  an  ERS,  all  alternate 
optimal  solutions  of  the  DEA  Xi^odels  would  need  to  be  identified.  This  can  be  compu¬ 
tationally  expensive  and  difficult  to  track.  Secondly,  this  counting  offers  only  a  limited 
amount  of  useful  information.  An  efficient  DMU  that  occurs  often  in  an  ERS  merely 
indicates  that  the  DMU  helps  define  part  of  the  efficient  surface  which  overshadows  a 
high  concentration  of  inefficient  DMUs.  Firms  utilizing  new  production  techniques  may 
be  extremely  efficient,  yet  operate  far  from  the  “crowd”  of  other  DMUs.  As  a  result, 
these  efficient  firms  do  not  occur  often  in  the  efficient  reference  sets  of  the  inefficient 
units.  Consequently,  these  maverick  DMUs  may  not  be  deemed  as  important  as  they 
should  be. 

To  discriminate  between  and  identify  the  successful  and  unsuccessful  production 
practices  of  the  DMUs,  a  new  procedure  is  necessary  that  provides  a  more  detailed 
classification  of  DMUs  than  the  ordinary  DEA  efficiency  scores  offer.  This  procedure 
should  result  in  a  rank  ordering  of  DMUs  which  serves  as  a  proxy  measure  for  managerial 
efficiency.  In  this  way,  managers  and  analysts  will  have  a  useful  tool  to  identify  manage¬ 
ment  practices  that  both  accentuate  and  detract  from  productive  efficiency  by  observing 
the  practices  of  the  higher  and  lower  ranked  DMUs.  Additionally,  experts  may  want  to 
state,  a  priori  to  the  DEA  analysis,  what  they  beUeve  to  be  the  most  efficient  firms  in 
the  industry.  The  rank  ordering  procedure  can  then  be  used  to  determine  how  the  DEA 
results  compare  with  the  experts’  opinions.  As  in  standard  DEA  analysis,  the  ordering 
should  allow  either  constant  or  variable  retums-to-scale  envelopments.  The  remainder  of 
this  chapter  presents  just  such  a  rank  ordering  procedure  which  is  easy  to  implement  and 
meets  all  of  the  above  criteria. 
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3.1.  Need  for  a  Ranking  Procedure 


Until  recently,  most  modem  writing  on  production  theory  assumed  that  aU  pro¬ 
ducers  were  efficient.  The  premise  was  that  in  a  competitive  market  place  the  inefficient 
producers  would  realize  the  direct  costs  and  indirect  opp>ortunity  costs  of  continued  pro¬ 
duction  and  would  leave  the  market  to  piusue  more  profitable  adventures.  However, 
economic  analysts  have  come  to  accept  that  inefficient  production  occurs  in  the  market 
place  and  its  causes  vary.  Inefficient  production  can  occm  because  information  on  the 
most  productive  methods  is  neither  perfect  nor  free.  As  a  result,  some  firms  may  be 
slower  to  respond  to  changing  market  conditions  than  others.  Along  with  imperfect 
information,  market  uncertainty  influences  the  production  process.  The  organization’s 
(or  the  manager’s)  position  towards  risk  will  dictate  the  rapidity  with  which  the  firm 
will  respond  to  change  in  the  shadow  of  this  uncertainty.  Additionally,  because  perfect 
competition  is  rarely  (if  ever)  seen,  regulations,  and  other  exogenous  constraints  may 
induce  inefficiencies  in  the  production  process.  Because  of  the  social  costs  associated 
with  inefficient  conversion  of  input  resources  to  output  goods,  there  has  been  a  growing 
interest  in  identifying  and  quantifying  the  inefficient  processes.  Data  envelopment  anal¬ 
ysis  has  proven  useful  in  measuring  various  types  of  the  production  inefficiencies  which 
may  be  attributed  to  inefficient  managerial  practices. 

The  original  DEIA  models  focused  on  identifying  technical  inefficiency  or  scale 
inefficiency  [11].  Fare,  Grosskopf,  and  Lovell  [53],  relaxed  the  usual  DBA  assumption 
of  strong  disposability  of  inputs  and  outputs  to  further  identify  inefficiencies  due  to 
congestion  of  resources.^  In  addition,  Fare,  Grosskopf,  and  Lovell,  following  the  lead 
of  Farrell  [51],  introduced  prices  of  inputs  and  outputs  to  identify  allocative  inefficiencies. 

^  Congestion  of  inputs  impUes  that  as  at  least  one  input  increases,  at  least  one 
output  is  reduced.  That  is,  there  is  not  a  positive  correlation  between  all  outputs  and 
inputs. 
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A  firm  demonstrates  allocative  inefficiency  when  it  departs  from  its  predefined  goal  such 
as  maximizing  profits  or  minimizing  costs. 

Even  though  technical,  scale,  and  allocative  inefficiencies  can  be  measured,  little 
has  been  written  to  formulate  a  means  of  ranking  the  DMUs  based  on  the  types  of  inef¬ 
ficiencies  they  demonstrate.  As  a  result,  even  though  the  inefficiencies  can  be  identified, 
they  have  not  fully  been  related  to  various  aspects  of  producer  behavior.  With  a  ranking 
system,  the  DMUs  exhibiting  the  best  production  processes,  in  terms  of  efficiency,  could 
be  compared  to  those  characterized  by  the  worst  production  techniques.  The  manage¬ 
ment  practices  of  the  best  producing  DMUs  could  then  be  compared  to  the  worst  DMUs 
in  order  to  identify  the  underlying  managerial  inefficiencies.  Once  identified,  the  less 
efficient  firms  could  adopt  the  practices  of  the  best  firms  to  improve  the  productivity  of 
their  operations. 

The  purpose  of  this  study  is  to  present  a  new  approach  to  rank  order  or  stratify 
the  DMUs  to  more  clearly  relate  the  efficiency  (or  inefficiency)  of  a  given  DMU  to  all 
others  in  the  set.  The  intent  is  not  to  suggest  this  approach  as  the  only  vahd  rank 
ordering  scheme.  Indeed,  any  set  of  items  cem  be  ranked  by  any  subjective  meems. 
However,  the  piupose  is  to  present  a  methodology,  with  theoretical  underpinnings,  which 
can  result  in  a  meaningful  ordering  system.  It  is  hoped  that  this  methodology  may 
stimulate  research  into  other  possible  ranking  procedures  so  that  the  richness  of  the  DEA 
methodology  can  be  more  fully  utfiized. 

3.1.1.  Basis  of  the  Ranking  Procedme 

Data  envelopment  analysis  defines  efficiency  based  on  empirical  observations  of 
each  evaluation  unit’s  behavior.  Each  DMU  consumes  multiple  inputs  in  order  to  pro¬ 
duce  one  or  more  outputs.  The  implicit  assumption  in  DEA  is  that  the  DMUs  transform 
the  inputs  into  outputs  by  means  of  a  well-behaved  production  technology.  A  feasible 
input-output  vector  for  a  given  DMU  is  technically  efficient  (Pareto-Koopmans  efficient) 
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if  it  is  technologically  impossible  to  increase  any  output  and/or  to  reduce  any  input 
without  simultaneously  reducing  at  least  one  other  output  and/or  increasing  at  least  one 
other  input.  The  empirical  models  of  DEA  use  observed  data  to  identify  the  DMUs  which 
form  the  Pareto-Koopmans  efficient  surface,  and  hence,  are  best  at  using  the  current 
technology  to  convert  the  input  resources  to  output  goods.  The  level  of  technological 
achievement  revealed  by  the  efficient  surface  will  be  highly  dependent  upon  the  choice 
of  DMUs  used  in  the  study  since  the  methodology  only  measures  relative  (not  absolute) 
efficiency. 

The  ranking  method  presented  in  this  study  separates  DMUs  into  groups  based 
on  the  level  of  technological  achievement  which  they  demonstrate.  It  will  be  shown  that 
this  aggregation  reveals  additional  aspects  of  inefficiencies  not  available  with  traditional 
DEA  measmes.  For  example,  a  DMU  that  appears  to  be  very  inefficient  by  the  standard 
DEA  measures,  may  rank  well,  compared  to  other  firms,  when  viewed  in  terms  of  the 
DMU’s  ability  to  employ  the  most  recent  technological  advances.  Once  the  DMUs  are 
separated  into  these  achievement  levels,  a  procedure  will  be  presented  to  rank  the  DMUs 
within  each  level.  The  ranking  within  a  level  will  be  determined  by  the  contribution  the 
DMU  makes  to  defining  the  shape  of  the  efficient  surface  of  that  level. 

An  attractive  feature  of  the  proposed  ranking  procedure  is  that  it  can  be  used 
across  the  many  formulations  of  the  DEA  models.  In  this  paper,  the  ranking  procedure 
is  introduced  and  apphed  to  both  the  input-  and  output-oriented  BCC  (variable-retums- 
to-scale)  models.  The  input-  and  output-oriented  models  achieve  identical  stratification 
of  DMUs  into  tiers  of  common  technological  achievement  levels.  However,  the  ranking 
within  each  tier  will  difier  according  to  the  orientation  used.  Next,  the  ranking  procedure 
is  applied  to  the  OCR  (constant  retmus  to  scale)  model.  In  this  case,  the  input-  and 
output-oriented  schemes  result  in  identical  rankings.  The  CCR  model  adds  interesting 
interpretations  of  the  most  productive  scale  size  to  achieve  each  technological  level. 
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3.2.  Tiered  DEA 


At  the  heart  of  data  envelopment  analysis  is  the  separation  of  evaluation  units 
into  (relatively)  efficient  and  inefficient  sets.  The  models’  objective  function  values,  0  or 
2,  have  been  used  as  metrics  for  the  degree  of  inefficiency  for  comparative  and  predictive 
purposes  [21,22].  Since  these  values  may  be  incompatible,  from  an  economic  point-of- 
view,  we  present  a  different  approach  to  comparing  DMUs  that  has  significant  appeal, 
from  both  intuitive  and  economic-theoretic  standpoints. 

3.2.1.  Tiering  Algorithm 

The  following  tiered  DEA  (TDEA)  procedure  stratifies  decision  units  into  tiers, 
or  layers,  of  comparable  productive  efficiency,  as  measured  by  any  standard  DEA  model. 

Procedure  TDEA 

1.  Initialize:  t  *—  1,  D[ij  *—  D. 

2.  While  D|t|  ^  0  do: 

a.  Apply  a  DEA  model  to  the  DMUs  in  set  Dj^j  to  identify  EJ'^j . 

I[i]  =  D[f]  -  Eft]- 

c.  t  ^t  +  1. 

d-  D[tj  =  If^j. 

where  t  is  a  tier  index  and  Ef^j  and  I^j  are  the  sets  of  efficient  and  inefficient  DMUs  on 
tier  t,  respectively,  relative  to  set  D[tj. 

The  TDEA  procedure  begins  with  a  traditional  data  envelopment  analysis,  then 
progressively  strips  away  production  siufaces,  revealing  a  series  of  frontiers  of  decreasing 
productivity.  Specifically,  at  tier  1  all  of  the  DMUs  in  the  data  set  are  analyzed  using 
a  standard  DEIA  model,  thus  separating  them  into  efficient  and  inefficient  sets.  The 
efficient  units  are  then  assigned  to  the  current  tier  and  the  inefficient  ones  become  the 
data  set  of  interest  for  the  next  higher  tier;  this  process  is  applied  recursively  until  all 
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DM  Us  are  assigned  to  a  tier.  In  this  way,  the  tier  levels  represent  successive  layers 
of  relatively  efficient  production  surfaces  where  the  DMUs  at  any  given  tier  are  less 
productively  efficient  than  those  of  “outer”  (lower-numbered)  tiers  and  more  efficient 
than  DMUs  at  “inner”  (higher-numbered)  tiers. 

An  example  application  of  the  TDEA  procedvue  can  be  seen  in  Figure  3.2  where 
10  DMUs,  each  with  1  output  and  2  inputs,  are  plotted.  All  DMUs  have  the  same  output 
level  so  only  the  inputs  are  shown.  DMUs  A,  B,  and  C  are  DEA  efficient  since  the  line 
segments  joining  these  DMUs  envelop  the  other  DMUs  from  below.  If  firms  A,  B,  and  C 
were  removed  from  the  data  set,  a  new  efficient  frontier  would  be  formed  by  DMUs  D,  E, 
F,  and  G.  The  TDEA  procediue  reveals  that  the  data  contains  three  production  surface 
layers. 


^2 
4 
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1 
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Figure  3.2.  Example  tiered  results  -  two  inputs,  one  output 

Figure  3.3  shows  10  DMUs  each  with  1  input  and  2  outputs.  In  this  case,  all 
DMUs  use  the  same  level  of  input  so  only  the  outputs  are  shown.  Again,  DMUs  A, 
B,  and  C  are  efficient  since  they  form  a  boundary  that  envelops  the  other  DMUs  from 
above.  TDEA  reveals  three  production  surface  layers. 
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Figure  3.3.  Eixample  tiered  results  -  one  input,  two  outputs 
3.2.2.  Example  TDEJA  Applications 

The  TDEA  procedure  was  applied  to  three  8,000-DMU  data  sets,  one  from  in¬ 
dustry  and  the  others  randomly  generated.  The  “Banking”  data  represents  a  selection 
of  banks  from  the  Federal  Reserve  Bank’s  Southwest  district,  with  6  input  and  3  output 
values,  as  described  in  {21].  A  set  of  “Cobb-Douglas”  data  with  5  input  and  4  output  val¬ 
ues  per  observation  was  created  with  DEA-GEN  using  constant  returns  to  scale  and  the 
parameter  set  a  =  (.20,  .20,  .20,  .20,  .20).  (See  Appendix  B  for  the  generation  procedure.) 
Also,  a  “Multi-Normal”  data  set,  with  4  inputs  and  3  outputs  and  the  variance-covariance 
matrix  given  in  Appendix  A,  was  generated  using  the  DRNMVN  routine  from  the  IMSL 
library. 

For  this  test  bed,  the  TDEA  procedure  used  the  CCRj  model  to  assign  DMUs 
to  layers.  Table  3.1  reports,  for  the  first  20  tiers,  the  number  of  DMUs  assigned  to 
each  layer  (“|E|’jj|”)  and  their  maximum,  minimum,  and  mean  tier  1  efficiency  scores 
(0).  Note  that,  in  each  case,  that  the  mean  and  maximiim  efficiencies  drop  with  each 
successive  interior  layer,  as  might  be  expected.  However,  the  TDEA  provides  insight 
that  DEA  cannot  provide.  As  noted  by  the  minimum  values,  some  DMUs  possess  a 
low  tier  1  efficiency  score,  yet  fall  on  an  outer  tier,  indicating  an  efficient  iise  of  current 
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Table  3.1.—  Large  data  set  tiered  DEA  results 
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technology.  Conversely,  the  maximum  values  indicate  that  while  other  DMUs  may  seem 
fairly  efficient,  from  a  ^-standpoint,  they  fall  on  an  inner  tier  because  of  less  productive 
uses  of  current  technology.  It  is  precisely  this  behavior  that  should  prove  useful  in 
identifying  a  DMU’s  true  level  of  efficiency. 

Of  interest  also  are  the  differences  between  these  three  problems.  The  frequency 
counts  reveal  that  the  banking  data  has  only  18  tiers,  while  the  Cobb-Douglas  data  still 
has  3,286  of  its  8,000  points  still  un-stratified  after  20  tiers.  The  multi-normal  problem 
has  more  populous  tiers  than  the  Cobb-Douglas,  but  less  than  most  of  the  bank’s.  The 
banking  and  multi-normal  data  sets  have  tier- 1-efficient  DMUs  on  interior  layers,  but 
Cobb-Douglas  does  not.  The  banking  and  Cobb-Douglas  data  have  much  larger  mean- 
efficiency  drops  between  tiers  1  and  2,  relative  to  the  multi-normal.  Each  problem  seems 
to  have  a  very  different  structure  from  the  others,  as  uncovered  by  the  TDEA  process. 

3.2.3.  Tiered  DEA  and  the  BCC  Model 

The  examples  of  the  figures  above  illustrate  the  result  of  applying  the  tiering 
procedure  to  the  BCC^  and  BCC°  models.  These  models,  used  to  identify  the  Pareto- 
Koopmans  efficient  production  surface,  with  no  restrictions  on  returns  to  scale,  were  first 
proposed  by  Banker,  Chames,  and  Cooper  (11).  The  BCCP^  model  can  be  written  as: 


(BCCj)  min  0  -  £(ls*  +  ls°)  (3.1) 

s.t.  YA  -  s°  =  Yj  (3.2) 

eXj  -  XA  -  «*  =  0  (3.3) 

lA  =  1  (3.4) 

\8\8°>0  (3.5) 

d  free  (3.6) 
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In  Figure  3.2  and  Figure  3.3,  the  outer  layers  designated  by  DMUyt,  DMUb, 
and  DMUc  represent  the  efficient  set  which  demonstrate  best  practice  in  the  production 
process.  However,  this  empirical  production  surface  is  dependent  on  the  data  observed. 

Suppose  the  original  set  had  not  included  B.  The  models  would  have  identified  a 
different,  yet  legitimate,  production  surface  revealing  the  current  technology  consisting  of 
DMUs  A,  E,  F,  and  C.  If  data  on  B  later  became  available  and  was  added  to  the  set  of 
observations,  it  would  markedly  alter  the  shape  of  the  efficient  production  surface.  B  now 
reveals  a  new  production  process  with  a  corresponding  level  of  technological  achievement 
that  was  previously  imseen.  It  is  precisely  this  realization  that  motivates  the  tiered 
ranking  procedure. 

In  DEA,  all  of  the  efficient  DMUs  share  a  common  characteristic:  they  all  demon¬ 
strate  an  ability  to  make  the  best  use  of  the  current  technology  to  conduct  their  produc¬ 
tion  process  (i.e.,  they  demonstrate  best  practice  behavior).  Once  these  DMUs  have 
been  identified,  they  can  be  temporarily  removed,  and  the  remaining  DMUs  form  a  new, 
valid  DEA  data  set.  When  the  BCC^j  model  is  appfied  to  this  new  data  set  a  new 
efficient  production  surface  is  revealed.  Had  the  data  for  DMUs  of  the  outer  tier  not 
been  available  originally,  this  new  production  smfface  would  legitimately  characterize  the 
best  technological  achievement  level  observed.  Consequently,  the  DMUs  comprising  this 
new  efficient  surface  share  a  common  level  of  success  of  utilizing  the  currently  revealed 
technology.  Repeating  this  process  groups  the  DMUs  according  to  common  achievement 
levels.  DMUs  on  outer  tiers  reveal  a  technological  advance  not  realized  by  DMUs  on 
inner  tiers. 

Of  primary  significance  is  that  the  tiering  procedure  provides  greater  discrim¬ 
inatory  power  in  determining  managerial  efficiency  than  previous  DEA  measures.  To 
appreciate  the  importance  of  the  new  measure,  an  understanding  of  what  causes  a  DMU 
to  be  inefficient  is  helpful.  By  stratifying  across  different  tiers,  the  new  measure  provides 
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a  more  complete  description  of  the  DMU’s  managerial  efficiency  relative  to  its  contem¬ 
poraries.  One  reason  a  DMU  may  be  characterized  as  relatively  inefficient  in  DEA  is 
that  it  is  dominated  by  a  few  highly  efficient  DMUs.  A  high  concentration  of  other 
DMUs,  with  similar,  but  superior,  management  practices  can  cause  the  inferior  DMUs 
to  attain  a  low  tier  assignment.  Figure  3.4  depicts  a  set  of  DMUs  with  one  output  and 
one  input.  The  tiering  procedure  produces  three  layers.  In  this  case,  J  is  enveloped 
by  a  concentration  of  DMUs  using  nearly  the  same  level  of  inputs  to  produce  a  similar 
level  of  outputs.  However,  J  consistently  imder-achieves  compared  to  the  other  DMUs, 
consequently,  it  would  receive  a  relatively  low  ranking.  Even  though  the  original  DEA 
scores  for  DMUs  F,  G,  and  J  are  similar,  I'DEA  further  discriminates  J  as  less  productive 
given  the  available  technology  than  DMUs  F  and  G. 


Y 
4 

3 

2 

1 

1  2  3  4  5  X 

Figure  3.4.  Example  1  tiered  results  -  one  input,  one  output 

A  second  reason  an  inefficient  DMU  may  result  in  a  low  efficiency  score  is  that  it 
operates  in  a  production  process  “away  from  the  crowd.”  These  maverick  DMUs  may  be 
leaders  in  the  introduction  of  new  production  technologies  or  management  methods  into 
the  market  place.  The  transition  to  these  new  procedures  is  penalized  under  traditional 
DEA  anal)rsis  because  it  appears  that  the  DMUs  introducing  the  new  inputs  consume 
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more  than  the  peers.  The  dominant  DMU  of  this  shift  may  be  efficient,  but  the  other 
DMUs  can  appear  to  be  very  inefficient.  This  situation  can  be  seen  with  H.  Notice, 
in  this  case  H  has  a  technical  efficiency  score  seen  as  the  ratio  77^-  J  has  a  higher 
technical  efficiency  score  seen  as  the  ratio  but  is  ranked  lower  than  H  in  terms  of 
tier  assignment.  H  may  represent  a  risk-taker  that  shows  a  short-term  reduction  in  its 
DEA  efficiency  score,  to  take  advantage  of  new  technology,  by  introducing  new  inputs. 
However,  even  with  a  short  term  loss  of  efficiency,  the  DMU  could  see  a  rise  in  its  rank 
ordering  by  moving  to  a  higher  tier  level.  Once  the  new  technology  is  fully  integrated  into 
the  production  process,  the  DMU  may  witness  substantial  increases  in  its  DEA  efficiency 
score.  IVaditional  DEA  analysis  would  penalize  the  management  decision,  in  the  short 
run,  to  introduce  the  new  technology  into  the  production  process.  Consequently,  the 
decision  to  incur  possible  short  term  losses  to  achieve  long  term  gains  would  appear 
unfavorable  in  a  traditional  DEA  analysis.  However,  the  stratified  ranking  of  DMUs 
reveals  the  success  of  this  management  decision. 

The  above  scenario  can  occur  often  in  a  firee  competitive  market.  Under  dynamic 
conditions,  the  firms  in  the  competitive  market  must  adapt  to  maintain  market  share.  A 
DMU  that  adopts  a  management  style  and  production  process  similar  to  other  DMUs, 
but  consistently  under-performs,  may  result  in  a  seemingly  high  relative  efficiency  score, 
but  with  a  low  rank  ordering.  The  inability  of  this  competitive  firm  to  find  its  niche  and 
distinguish  itself  firom  the  “competition”  may  result  in  its  failure  in  the  market  place. 
Current  DEA  methodologies  are  inadequate  to  reveal  such  conditions;  TDEA  offers  an 
attractive  means  to  discriminate  between  these  managerial  behaviors. 

DMUs  with  low  efficiency  scores  but  with  relatively  high  rank  order  are  not 
restricted  to  DMUs  at  “fringe”  production  levels.  Data  outliers  can  strongly  affect  the 
shape  of  the  efficient  surface.  Figure  3.5  shows  the  effect  that  outher  B  has  on  the 
production  siirface.  The  outlier  significantly  distorts  the  surface  making  F  seem  relatively 
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inefficient.  Had  B  not  been  in  the  data  set,  F  would  be  efficient.  In  spite  of  F’s  low 
efficiency  score,  TDEIA.  ranks  F  relatively  high.  Consequently,  the  performance  of  F  may 
not  be  as  poor  as  indicated  by  the  DEA  score. 

Y 
4 

3 

2 
1 

1  2  3  4  5  X 

Figure  3.5.  Example  2  tiered  resrilts  -  one  input,  one  output 

A  key  advantage  of  stratifying  DMUs  into  tiers  is  that  it  allows  the  DEA  method¬ 
ology  to  more  closely  describe  true  managerial  efficiency  that  may  be  masked  by  tradi¬ 
tional  DEA  analysis.  As  a  result,  managers  of  inefficient  DMUs  have  increased  flexibility 
in  improving  production  operations.  The  mamiger’s  long-term  goal  may  be  to  achieve 
efficiency.  The  DMU  can  strive  to  8Ux:omplish  this  by  improving  the  short  nm  rank 
ordering  without  a  myopic  focus  on  its  DEA  efficiency  score. 

3.2.4.  Tiered  DEA  and  the  CCR  Model 

The  ranking  procedure  can  also  be  applied  to  the  CCR  models.  For  these  models, 
which  assume  constant  retmus  to  scale,  the  efficient  DMUs  not  only  are  operating 
most  efficiently  (with  the  greatest  level  of  technological  achievement)  but  they  are  also 
operating  at  the  most  productive  scale  size  (MPSS).  For  the  single  input  and  single 
output  model,  the  MPSS  is  determined  by  the  DMUs  yielding  the  highest  ratio  of 
quantity  output  to  quantity  input.  In  economic  terms,  this  equates  to  the  DMUs  yielding 


the  highest  average  product.  Banker  |10j  demonstrated  that  the  CCR  model  revealed  the 
efficient  DMUs  operating  at  the  MPSS  in  multi-dimensional  processes. 

The  MPSS  units  which  form  the  frontier  for  the  CCR  models  are  dependent  on 
the  observed  set  of  data.  In  a  traditional  DBA  analysis,  the  CCR  scores  for  the  inefficient 
DMUs  reflect  both  technical  and  scale  inefficiencies.  To  separate  the  technical  from  the 
scale  inefficiency,  the  BCC  model  must  also  be  run.  The  tiered  procedure,  when  apphed 
to  the  CCR  model,  presents  a  different  picture  of  scale  inefficiency.  As  each  outer  tier 
is  removed  from  the  data  set,  the  set  of  DMUs  which  composes  the  most  productive 
scale  size  changes.  As  a  result,  the  values  of  the  scale  inefficiencies  for  the  other  DMUs 
also  change.  Therefore  traditional  DEIA  measures  may  overstate  the  amoimt  of  scale 
inefficiency  that  a  DMU  demonstrates. 


Figure  3.6.  Example  CCR  tiered  results  -  one  input,  one  output 


Figure  3.6  shows  two  tiers  of  scale-efficient  DMUs  for  the  same  set  of  data  used  in 
Figure  3.4,  where  DMUs  A,  B,  C,  and  D  were  shown  to  be  technically  efficient.  However, 
Fig  3.6  indicates  B  and  C  are  scale  efficient  while  A  and  D  are  both  scale  inefficient. 
Because  D  falls  far  from  the  most  productive  scale  boundary,  the  CCi?j  DBA  measure 
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would  indicate  it  is  more  scale  inefficient  than  A.  Yet,  with  the  tiering  procedure,  both 
become  scale  efficient  at  a  common  level.  Ck>nsequently,  the  two  DMUs  may  not  be  as 
different,  in  terms  of  scale,  as  first  suggested  by  the  DEA  scores. 

3.3.  Ranking  Within  Tiers 

The  TDEIA.  approach  extends  traditional  DEA  to  allow  for  a  stratification  of 
DMUs  reflecting  different  productive  efficiency  layers.  However,  a  ranking  system  within 
each  layer  is  still  needed.  This  section  describes  a  new  procedure  that  can  provide  a 
useful  ranking. 

Each  tier  defines  a  set  of  DMUs  that  forms  a  production  surface  (layer).  To 
determine  the  relative  rank  ordering  among  the  DMUs  at  each  tier,  one  could  measure 
the  contribution  of  the  DMU  to  the  shape  of  its  layer.  DMUs  that  significantly  distort 
the  production  surface  layer  on  which  they  are  assigned  play  a  more  prominent  role  in 
determining  the  shape  of  the  production  surface.  Consequently,  if  these  distortions  can 
be  measured,  the  DMUs  could  be  ranked  by  the  degree  of  distortion  they  contribute  to 
the  production  layer. 

The  following  model  describes  an  extremal  DEA  (EDEA)  approach  for  the  CCR^j 
model.  However,  the  formulation  also  appUes  to  the  output-oriented  models  and  to 
either  constant-  or  variable-retums-to-scale  formulations.  With  this  methodology,  the 
DEA  formulation  is  modified  slightly  to  measure  how  far  a  DMU  is  from  the  resulting 
production  siirface  when  the  DMU  itself  is  removed  from  the  data  set  of  interest.  The 
EDEIA  procedure  can  be  described  as  follows.  Let  X  be  an  (m  x  n)  matrix  and  Y  be 
an  (s  X  n)  matrix  of  the  observed  input  and  output  values,  respectively,  of  all  the  DMUs 
of  interest.  Select  a  DMU  to  be  observed,  in  this  case  DMUo.  Let  XW  and  be  X 

and  Y,  respectively,  with  the  observations  of  DMU®  removed.  Then  the  extremal  DEIA 
measme  6  is  computed  by  the  following  model. 
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(EDEA  Model) 


min  6 


(3.7) 

s.t.  Y^A  -  «“  =  n  (3.8) 

0X-XMA-a*  =  O  (3.9) 

A,  a®  >  0  (3.10) 

6  free  (3.11) 


In  contrast  to  a  traditional  DEA  analysis,  6  can  now  be  greater  than  one.  For 
(?  >  1.0  the  DMU  will  be  efficient  and  6  measures  the  allowable  proportional  increase 
in  inputs  for  the  DMU  to  remain  efficient.  If  ^  =  1,  the  corresponding  DMU  may  be 
weakly  efficient  from  a  DEIA  standpoint.  However,  for  the  inefficient  DMUs,  the  EDEIA 
scores  will  be  identical  to  traditional  DEA  scores,  (e.g.,  0  =  B),  since  the  inefficient  DMU 
itself  will  never  be  a  member  of  an  optimal  basis  (this  can  be  shown  in  the  same  manner 
as  Theorem  1  of  Chapter  H).  Consequently,  EDEA  has  an  advantage  over  traditional 
DEA  models  in  that  it  provides  greater  meaning  to  the  scores  for  the  efficient  DMUs  by 
allowing  additional  variabUity  in  the  efficient  score  values. 

X2 

4 

3 

2 

1 

1  2  3  4  5  Xj 

Figure  3.7.  Extremal  DEA  example 
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Figure  3.7  illustrates  the  EDEA  procedure  for  B  when  appUed  to  the  data  in 
Table  3.2.  Here,  B  is  projected  onto  the  “new”  efficient  surface  using  EDEA.  The 
resultant  objective  value,  0*  =  measures  how  “far”  B  is  from  the  efficient  surface. 
Viewing  the  problem  from  a  different  perspective,  6*  reflects  the  degree  to  which  B  would 
contribute  to  the  shape  of  the  new  efficiency  surface  if  it  was  added  to  the  data  set. 

DMUs  that  cause  significant  and  important  distortions  of  the  efficiency  surface 
will  result  in  a  high  EDEA  objective  value.  Those  DMUs  that  have  httle  influence  on 
the  shape  of  the  production  surface  will  have  objective  values  close  to  1.  Consequently, 
the  DMUs  can  be  ranked  by  order  of  influence  at  each  tier  level  based  on  the  EDEIA 
scores. 

An  overall  ranking  of  all  DMUs  can  be  achieved  by:  (1)  using  TDEA  to  to  stratify 
all  DMUs  into  tiers,  (2)  applying  EDEIA  to  each  tier  level,  (3)  ranking  each  tier’s  DMUs 
by  dy  (4)  then  finding  each  unit’s  overall  rank  by  ranking  those  DMUs  on  outer  tiers  as 
more  important  than  those  on  inner  tiers.  The  following  section  presents  an  illustrative 
example  of  this  procedure  and  provides  some  numerical  interpretations. 

3.4.  Example  of  Tiered  DEA 

The  illustration  shown  in  Figure  3.2  is  based  on  the  data  in  Table  3.2.  As  illus¬ 
trated  in  Figure  3.2,  the  TDEA  approach  yields  three  production  surfaces.  Those  DMUs 
on  tier  1  are  the  same  efficient  units  found  in  a  standard  envelopment  analysis.  The 
DMUs  of  succeeding  inner  tiers  comp<^  less  efficient  production  surface  layers.  Notice 
though,  the  inefficient  DMUs  can  now  be  further  distinguished  by  the  production  surface 
on  which  they  fall.  DMUs  on  outer  tier  levels  represent  more  successful  production 
processes.  Consequently,  the  DMUs  on  outer  tiers  can  be  ranked  more  efficient  than 
DMUs  on  inner  tiers.  As  will  be  demonstrated,  this  does  not  necessarily  coincide  with 
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Table  3.2.  -  Tiered  DEA  example  data 


DMU 

A 

B 

C 

D 

E 

F 

G 

H 

J 

K 

y 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

Xi 

0.50 

1.00 

5.00 

1.00 

1.50 

2.00 

4.50 

2.00 

3.00 

4.50 

X3 

4.00 

1.50 

1.00 

3.50 

2.50 

2.00 

1.50 

3.50 

2.50 

1.80 

Table  3.3.  —  Results  of  the  tiered  rank  ordering  procedure 


DMU 

A 

B 

C 

D 

E 

F 

G 

H 

J 

K 

DEA 

1.00 

1.00 

1.00 

.765 

.650 

.722 

.788 

.482 

.565 

.688 

TDEA 

1 

1 

1 

2 

2 

2 

2 

3 

3 

3 

TEDEAl 

2.00 

1.60 

1.50 

.765 

.650 

.722 

.788 

.482 

.565 

.688 

TEDEA2 

— 

— 

— 

1.50 

1.05 

1.13 

1.33 

.733 

.774 

.889 

TEDEA3 

— 

— 

— 

— 

— 

— 

— 

1.50 

1.13 

1.39 

RANKl 

1 

2 

3 

— 

— 

— 

— 

— 

— 

— 

RANK2 

— 

— 

— 

1 

4 

3 

2 

— 

— 

— 

RANK3 

— 

— 

— 

— 

— 

— 

— 

1 

3 

2 

RANK 

1 

2 

3 

4 

7 

6 

5 

8 

10 

9 

DEARANK 

1 

1 

1 

5 

8 

6 

4 

10 

9 

7 

the  traditional  DEA  score.  The  results  of  the  TDEA  coupled  with  the  EDEA  are  shown 
in  Table  3.3  and  labeled  as  TEDEIAl  through  TEDEA3. 

The  DEA  row  lists  the  CCR*  6  scores  for  each  unit  in  the  data  set.  The  TDEIA 
row  indicates  the  tier  level  to  which  each  DMU  is  assigned  as  a  result  of  the  TDEA 
procedure.  TEDEAf  rows  give  the  EDEA  §  values  for  all  DMUs  on  tier  t  or  higher. 
DMUs  in  tier  t  are  ranked  within  tier  in  the  RANKf  rows.  The  overall  rank  for  the  entire 
set  of  DMUs  is  given  in  the  RANK  row.  This  can  be  compared  to  the  ranking  the  DMUs 
would  have  been  given,  listed  in  DEARANK,  had  they  been  ordered  by  the  CCR*  6 
value. 

Some  important  observations  can  be  made  concerning  these  results.  Notice,  the 
DEA  and  the  TEDEAl  results  are  identical  for  the  inefficient  DMUs.  Likewise,  the 
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efficient  DMUs  are  appropriately  identified  with  values  greater  or  equal  to  1.  Notice  that 
the  higher  inefficiency  scores  do  not  necessarily  indicate  on  what  tier  level  a  DMU  may 
fall.  For  example,  K  has  a  higher  6  than  E  but  falls  on  a  lower  tier  level.  By  observing 
the  resixlts  at  TEDEA3  one  notices  that  the  ranking  of  H,  J,  and  K  does  not  correspond 
with  the  ranking  the  DMUs  would  receive  if  the  DEA  efficiency  scores  were  used.  In 
fact,  H  has  the  lowest  DEA  efficiency  score,  but  the  highest  rank  of  tier  3.  Figure  3.8 
illustrates  why  this  is  so.  DMU  J  does  not  significantly  distort  the  efficiency  surface  that 
exists  when  it  is  not  present.  This  is  not  true  for  H  which  significantly  <iistorts  the  shape 
of  the  production  surface;  DMU  H  is  more  influential  than  DMU  J  and  thus  is  ranked 
higher. 


Figure  3.8.  TEDEA  examples 


A  major  advantage  of  TEDEIA,  besides  ranking  each  DMU  by  its  influence,  is 
that  it  can  help  paint  a  numerical  picture  of  the  environment  in  whidi  a  DMU  operates. 
As  each  tier  level  and  associated  production  surface  is  removed,  the  new  DEA  scores  for 
the  remaining  DMUs  can  be  calculated.  In  this  yr&y,  the  migration  of  the  DEA  scores 
for  a  particular  DMU  can  be  traced  through  a  series  of  tiers.  A  rapid  rise  in  the  scores 
may  indicate  that  the  DMU  is  in  a  region  with  a  relative  low  density  of  other  DMUs 
but  is  dominated  by  a  few  highly  efficient  ones.  A  slow  rise  in  the  score  may  indicate 
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that  the  DMU  is  surrounded  by  a  larger  density  of  other  DMUs  which  may  have  similar 
management  styles  or  environments  but  are  operating  more  efficiently.  This  information 
can  assist  the  analyst  and  managers  in  determining  appropriate  courses  of  actions  to 
improve  either  the  rank  ordering  or  the  efficiency  score. 

The  rank  ordering  procedure  may  also  prove  useful  in  window  analysis.  If  DMU 
behavior  is  tracked  over  time,  the  changes  in  rank  ordering  should  reflect  the  relative 
effectiveness  of  on-going  managerial  decision-making.  These  managerial  changes  may 
remain  hidden  from  traditional  DEA  analysis  unless  the  changing  practices  result  in  a 
change  in  efficient  or  inefficient  status  of  the  DMU.  Consequently,  the  rank  ordering 
methodology  may  provide  prompt  managerial  feedback  as  to  how  a  DMU  compares  with 
the  competition  as  a  result  of  implemented  changes. 

3.5.  Computational  Considerations 

It  is  important  to  note,  \inlike  CCR  or  BCC  models,  with  the  EDEA  method  the 
basic  elements  in  the  optimal  solution  need  not  represent  efficient  DMUs.  Figure  3.7 
depicts  such  a  situation.  When  B  is  removed  from  the  data  set,  the  resulting  effi¬ 
cient  surface  includes  E  and  F,  both  of  which  are  inefficient  according  to  a  traditional 
DEA.  Therefore,  the  advantage  of  computational  efficient  techniques  for  DEA,  such  as 
restricted  basis  entry  and  early  identification  of  efficient  DMUs,  cannot  be  maintained 
for  the  EDEA  models.  However,  other  computationally  efficient  procedures  are  possible 
for  the  EDEA  model.  Computational  efficiency  can  be  maintained  by  combining  TDEIA. 
and  EDEA  into  a  single  formulation,  TEDEA*,  based  on  the  following  observation. 

When  a  DMU  is  removed  from  tier  Vs  data  set,  the  resulting  production  surface 
will  consist  of  DMUs  belonging  to  either  tier  t  or  t+1. 

In  this  way,  the  stratification  of  DMUs  proves  to  be  a  valiiable  computational 
tool  for  the  EDEA  method.  Since  the  number  of  DMUs  at  tier  t  and  <  -i- 1  is  typically 
small  compared  to  the  entire  set,  the  linear  programming  problems  remain  small  when 
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determining  the  EDEA  scores.  As  a  result,  TDEA  and  EDEA  can  be  combined  to  form 
a  computationally  efficient  rank  ordering  method.  Let  and  Yf  be  the  matrices  of 
input  and  output  vectors  of  DMUs  belonging  to  tier  t  or  t+1.  Choose  a  DMU  from  tier 
t  for  analysis,  let  this  be  DMUo.  The  TEDEA*  model  can  be  written  as: 


(TEDEA*) 


min  6 

(3.12) 

II 

0 

1 

(3.13) 

dXo  -  x["U  -  s*  =  0 

(3.14) 

A,  s*,  s°  >  0 

(3.15) 

6  free 

(3.16) 

Because  the  TEDEIA*  problem  consists  of  LP’s  much  smaller  than  EDEA,  computational 
efficiency  is  maintained. 


3.6.  Additional  Benefits  of  Ranking 
As  mentioned  previously,  the  empirical  production  surface  of  the  DEA  model  is 
greatly  influenced  by  possible  outliers  in  the  data.  In  1971,  Timmer  [86]  assumed  a 
Cobb-Douglas  form  to  develop  a  probabilistic  frontier  production  function  for  observed 
data.  Using  the  Cobb-Douglas  form,  Timmer  translated  the  problem  into  a  linear 
programming  model  with  a  striking  similarity  to  the  DEA  model.  Since  he  assumed 
a  single  output,  multi-input  case,  he  was  able  to  compare  the  frontier  analysis  with  the 
traditional  econometric  model.  He  showed  that  by  eliminating  the  top  2%  of  observations 
which  appeared  to  be  outliers,  the  linear  programming  frontier  model  yielded  results  that 
could  be  supported  by  the  econometric  models. 

In  DEA,  no  functional  form  is  assmned.  Consequently,  potential  outliers  have 
bern  difficult  to  identify.  In  this  case,  the  efficient  DMUs  may  not  represent  normal 
production  activities  and  therefore  may  not  serve  as  a  legitimate  basis  of  comparison. 
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Consequently,  DEA  efficiency  scores  of  outliers  could  distort  the  true  picture  of  manage¬ 
rial  efficiency  of  the  inefficient  DMUs.  As  a  result,  identification  of  pc^ible  outher  or 
extremely  dominant  DMUs  is  essential.  With  TEDEA*,  the  outliers  will  be  located  on 
outer  tiers.  If  possible  outliers  are  revealed,  the  problematic  DMUs  can  be  removed  from 
the  analysis.  By  following  this  procedure,  a  better  estimation  of  managerial  efficiency 
can  be  achieved. 


3.7.  Conclusions 

This  chapter  has  outlined  a  rank  ordering  of  DMUs  based  on  the  influence  they 
serve  in  forming  empirical  production  layers  for  the  data  set.  The  intent  is  not  to  present 
the  only  valid  system  or  rank  ordering,  but  to  stimulate  thought  as  to  how  DEA  methods 
may  be  modified  to  make  the  results  more  meaningful  to  practitioners.  One  concern  with 
the  rank  ordering  approach  is  the  large  computational  requirement  necessary  to  achieve 
the  results.  On  some  layers  the  density  of  efficient  DMUs  will  be  high  and  on  other 
layers  the  density  may  be  low.  The  code  must  be  computationally  efficient  across  this 
wide  variety  of  conditions.  In  addition,  the  code  must  be  flexible  to  allow  for  constant 
or  variable  returns  to  scale  in  either  the  input  or  output  models.  FlexibiUty  to  switch 
between  models  at  difi'erent  layers  may  also  be  desirable.  The  TEDEA*  approach  meets 
the  requirements  to  effectively  and  efficiently  perform  such  tasks. 

The  rank  order  procedure  presented  in  this  discussion  classifies  DMUs  by  their 
ability  to  use  current  technology  to  efficiently  conduct  the  production  process.  This 
classification  can  serve  as  a  proxy  measure  of  management  efficiency  that  provides  more 
detailed  information  than  offered  by  standard  DEA  models.  By  identifying  the  best  and 
worst  performing  DMUs,  analysts  can  pinpoint  which  DMUs  should  serve  as  a  basis  of 
comparison  from  which  to  determine  which  management  practices  enhance  and  which 
deter  from  productive  efficiency.  In  addition,  the  rank  ordering  methodology  presented 
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here  reveals  the  need  to  further  characterize  the  computational  issiKS  of  DEA  and 
provide  efficient  codes  to  accomplish  these  new  tasks. 
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CHAPTER  IV 


DEA  SENSITIVITY  ANALYSIS 

4.1.  Motivation 

In  1978,  Chames,  Cooper,  and  Rhodes  (CCR)  [37]  developed  a  technique,  which 
they  named  data  envelopment  analysis  (DEIA),  to  determine  the  relative  efficiency  of 
a  production  unit  (commonly  known  as  a  decision  making  unit,  or  DMU)  compared 
to  other  units  using  the  same  kinds  of  inputs  to  produce  the  same  kinds  of  outputs. 
There  are  three  popular  linear  programming  formulations  which  indicate  whether  a  DMU 
is  relatively  efficient  or  inefficient.  The  CCR  model  [37]  assesses  whether  the  DMU 
is  efficient  at  the  most-productive  scale-size  (MPSS).  This  model  is  perhaps  the  most 
widely  used  in  DEA  applications.  The  Banker,  Cha  .-nes,  and  Cooper  (BCC)  model  [11] 
and  the  additive  model  [33]  determine  whether  the  DMU  is  Pareto-Koopmans  efficient. 
In  economic  contexts,  the  BCC  and  additive  models  find  the  entire  efficient  empirical 
production  surface  while  the  CCR  model  finds  a  proper  subset  of  that  siirface  which  is 
composed  of  efficient  units  at  the  MPSS.  For  a  single-output,  single-input  DMU  set,  the 
MPSS  is  the  size  that  yields  the  greatest  average  product  (i.e,  the  greatest  ratio  of  output 
to  input). 

Within  the  context  of  one  of  these  models,  a  DMU’s  status  is  either  efficient  or 
not  efficient.  To  determine  this  status,  a  separate  linear  program  must  be  solved,  either 
directly  of  implicitly.  Since  the  DEA  methodology  is  empirically  based,  values  of  the 
inputs  and  outputs  must  be  measured  or  estimated  at  a  given  point  in  time  for  each 
DMU.  A  concern  then  is  the  accuracy  of  the  measures  or  estimates,  the  comparability  of 
the  decision  units  (particularly  on  a  temporal  basis),  and  any  uncertainty  associated  with 
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the  observed  values.  Any  of  these  factors  can  affect  the  validity  of  an  efficiency  analysis 
and  influence  the  status  of  all  of  the  decision  units  under  consideration. 

Sensitivity  analysis  is  an  important  element  of  any  DEA  study  because  it  assesses 
the  rob\istness  of  each  DMU’s  status  when  variations  in  the  inputs  and  outputs  are 
possible.  This  is  more  than  a  simple  extension  of  traditional  LP  sensitivity  theory 
because  of  the  nature  of  the  DEA  models. 

Traditional  sensitivity  analysis  in  linear  programming  concentrates  on  finding  a 
new  optimal  solution  value  to  a  problem,  as  updated  estimates  of  some  of  the  sample 
data  become  available,  without  the  expense  of  resolving  the  problem  from  scratch.  This 
may  take  the  form  of  determining  ranges  within  which  data  may  be  varied  without 
requiring  a  change  in  the  set  of  vectors  composing  the  optimal  basis.  For  the  DEA  model 
formulation,  this  traditional  anal}r8is  is  inappropriate  because  a  change  in  the  values  of 
the  inputs  and  outputs  of  a  particular  DMU  simultaneously  affects  structural  and  right- 
hand-side  coefficients  for  all  LPs  in  the  analysis.  Additionally,  the  assumptions  of  non¬ 
degeneracy  of  the  optimal  basis  and  uniqueness  of  the  optimal  solutions  are  violated  for 
the  DEA  problems.  Also,  in  DEA,  the  primary  concern  is  the  effect  that  perturbations 
may  have  on  the  status  of  a  DMU,  not  simply  on  the  solution  value  of  the  underlying 
linear  programming  problem.  Consequently,  new  approaches  for  sensitivity  analysis  must 
be  developed  when  considering  DEIA  models. 

To  date,  a  number  of  studies  address  sensitivity  analysis  in  DEA  [33,36,43,44], 
most  with  respect  to  the  additive  model.  Chames,  Cooper,  Lewin,  Morey,  and  Rousseau 
[36]  present  a  limited  study  of  perturbations  to  a  single  output  or  single  input  using  the 
CCR  model.  This  study  was  extended  by  Chames,  Haag,  Jaska,  and  Semple  (CHJS)  [43] 
to  identify  regions  of  stability  using  the  additive  model,  in  which  allowable  perturbations 
cause  the  DMU  to  retsun  its  status.  Because  the  additive  model  flnds  the  Pareto- 
Koopmans  efficient  frontier  without  respect  to  an  input  or  output  orientation,  the  CHJS 
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example  is  restricted  to  simultaneous  perturbations  of  both  the  inputs  and  outputs  of 
each  DMU  in  the  model. 

This  chapter  presents  a  new  methodology  that  allows  for  sensitivity  analysis 
in  the  case  of  proportional  changes  to  either  the  outputs  alone  or  the  inputs  alone 
for  the  BCC  and  CCR  models.  The  approach  will  identify  a  range  of  values  such 
that  all  allowable  perturbations  for  the  DMU  within  the  range,  ceteris  paribus,  will 
preserve  the  DMU’s  current  status.  Since  the  analysis  can  be  conducted  in  conjunction 
with  determining  the  DEA  efficiency  score,  minimal  additional  computational  costs  are 
required.  Also  presented  is  an  update  to  the  CHJS  approach  that  overcomes  inherent 
problems  with  their  additive  model  approaches. 

The  remainder  of  this  chapter  is  divided  into  four  sections.  Section  4.2  devel¬ 
ops  the  framework  for  conducting  sensitivity  analysis  on  DEA  problems.  Section  4.3 
introduces  the  methodology  to  conduct  sensitivity  analysis  on  each  DMU  for  the  BCC 
and  CCR  models.  Although  the  sensitivity  analysis  is  based  on  proportional  changes 
in  either  outputs  or  inputs,  nonproportional  changes  will  also  be  investigated.  Section 
4.4  compares  the  methodology  presented  in  Section  4.3  to  the  CHJS  sensitivity  analysis 
approach.  A  problematic  concern  of  the  CHJS  method  is  presented  and  a  demonstration 
of  how  the  new  methodology  compliments  the  CHJS  to  meet  that  concern.  Section  4.5 
introduces  an  expanded  sensitivity  analysis  approach  based  on  the  models  introduced 
in  Section  4.3.  Although  the  expanded  sensitivity  analysis  may  be  computationally  in¬ 
tensive,  it  can  reveal  information  to  guide  the  management  in  determining  what  future 
actions  should  be  taken  to  maintain  or  establish  efficiency.  Conclusions  are  given  in 
Section  4.6. 
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4.2.  BCC  and  OCR  DEA  Sensitivity  Analysis 


A  new  method  for  sensitivity  analysis  in  DEA  can  be  constructed  from  simple 
modifications  to  the  CCR  and  BCC  DEA  formulations.  In  this  approach,  the  sensitivity 
of  a  DMU’s  status  is  measured  in  terms  of  either  proportional  changes  in  outputs  or 
inputs.  The  concerns  in  DEIA  sensitivity  analysis  differ  dramatically  between  DMUs 
which  are  efficient  and  those  that  are  inefficient.  For  efficient  DMUs,  an  increase  of  any 
output  caimot  result  in  a  lower  efficiency  rating;  likewise,  a  decrease  in  any  input  cannot 
worsen  the  DEA  efficiency  score.  Therefore,  with  efficient  DMUs,  primary  attention 
is  given  to  determining  what  reductions  in  outputs  are  allowable,  or  what  increases  in 
inputs  are  permissible  in  order  to  insure  the  DMU  retains  its  efficient  status.  For  efficient 
DMUs  one  is  concerned  with  identifying  how  bad  the  production  process  can  get  before 
the  DMU  loses  its  efficient  statiis. 

This  motivation  fundamentally  differs  for  the  inefficient  DMUs,  for  which  the 
primary  concern  is  the  identification  of  changes  that  will  improve  the  efficiency  rating. 
Generally,  one  would  fike  to  identify  those  areas  that  require  the  smallest  improvements 
which  would  cause  a  given  DMU  to  become  relatively  efficient.  For  inefficient  DMUs 
this  woffid  consist  of  either  increasing  outputs  or  decreasing  inputs.  We  now  present  an 
elegant  formulation  for  tmalyzing  both  efficient  and  inefficient  DMUs  in  the  CCR  and 
BCC  models. 


4.2.1.  CCR  and  BCC  Sensitivity:  A  Backgroimd 
For  the  sensitivity  analysis,  let  DMU*  be  the  DMU  of  interest.  In  this  model, 
we  define  D  =  {1, . . . , n},  the  index  set  of  DMUs,  with  X  =  [Xi, . . . , Xn]  and  Y  = 
[Yi , . . . ,  Yn\.  Vectors  (Xj,  Yj)  denote  the  observed  data  where  Xj  =  (xij, . . . ,  Xij, . . . ,  Xmj) 
is  a  column  vector  of  observed  input  values  with  Xij  >  0  (at  least  one  positive)  and 
Yj  =  (j/ij, . . . ,  yrj.  •  ■  • .  I/aj)  is  a  colunm  vector  of  observed  outputs  with  >  0  (at 
least  one  positive).  Let  be  the  X  matrix  with  the  X*  column  removed.  Similarly 
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define  yI*®!  to  be  Y  with  the  Yk  column  removed.  A  new  approach  called  extremal  DEA 
(EDEIA)  will  form  the  basis  for  the  DEA  sensitivity  analysis.  The  approach  allows  for 
both  an  input  and  output  oriented  analysis  and  can  be  used  within  the  CCR  and  BCC 
modeling  contexts.  The  CCR-based  EDEA  models  can  be  formulated  as  follows,  with 
ECCR*  and  ECCRf*  having  input  and  output  orientations,  respectively. 


{ECCRi) 


{ECCRD 


min  w  =  6 

(4.1) 

s.t.  Yf^U  -  s°  =  Yfc 

(4.2) 

eXk  -  X^A  -  s*  =  0 

(4.3) 

A,  «*, >  0 

(4.4) 

0  free 

1.5) 

min  w  =  ^ 

(4.6) 

s.t.  xWA+s*=Xife 

(4.7) 

<^Yfc-YWA  +  s°  =  0 

(4.8, 

A,  «*,  >  0 

(4.9) 

^free 

(4.10) 

The  BCC  versions  of  the  above  EDEA  models  —  EBCC*  and  EBCC°,  respectively  — 
can  be  obtained  by  appending  the  constraint:  lA  =  1. 

In  the  discussion  that  follows,  the  term  DEA  models  refers  to  the  original  CCR, 
BCC,  and  additive  models.  In  standard  DEA  form,  the  index  set  of  frontier-efficient 
DMUs  of  the  BCC  and  CCR  models  which  are  Pareto-Koopmans-efficient  is  designated 
as  E*  =  6  D|^*  =  1  (or  =  1),  a°*  =  0,  and  «**  =  0}.  The  set  F  is  the  index  set 

of  frontier-efficient  but  not  Pareto-Koopmans  efficient  DMUs,  where  F  =  {j  £  D\6*  =  1 
or  {<l>*  =  1)  and  ^  0  or  s**  ^  0.  For  a  DMU  to  be  Pareto-Koopmans  efficient,  it 
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must  not  be  possible  to  decrease  any  input  value  without  also  decreasing  at  least  one 
output  or  increasing  at  least  one  other  input  value.  In  addition,  to  be  Pareto-Koopmans- 
efficient,  it  must  not  be  possible  to  increase  any  output  value  of  DMUfc  without  also 
increasing  at  least  one  input  value  or  decreasing  at  least  one  other  output.  All  DMUs 
on  the  frontier,  but  which  are  not  Pareto-Koopmans  efficient,  i.e.,  the  members  of  F, 
are  labeled  weakly  efficient  Inefficient  DMUs  have  d*  <  1  for  the  input-oriented  model 
and  <(>*  >  1  for  the  output-oriented  model.  With  EDEA,  $*  and  values  for  inefficient 
DMUs  are  identical  to  their  corresponding  variables,  d*  and  <(>*,  in  the  DEA  models. 
However,  frontier  efficient  DMUs  in  EDEA  are  characterized  by  <9*  >  1  or  ^*  <  1.  To 
understand  this,  a  discussion  of  the  DEA  formulations  is  in  order. 

The  description  of  the  DEA  formulations  gives  insight  into  the  meaning  of  9* 
((!>*).  The  DEA  models  identify  the  set  of  efficient  DMUs,  E*,  as  those  fonning  an 
empirical  production  surface  which  envelops  all  DMUs  in  D,  the  set  under  investigation, 
111,31,33,39,41].  For  the  OCR,  BCG,  and  additive  models,  all  DMUs  in  D  are  evaluated 
against  the  DMUs  which  are  members  of  E*.  For  the  ECCR  and  EBCC  models,  the 
problem  reference  set  is  denoted  as  D  =  D— fc,  where  DMUfc  is  the  unit  under  evaluation. 
If  fc  ^  E*  U  F,  then  DMUfc  is  inefficient  and  9*  =  9*  or  =  (f>*.  However,  if  ik  6  E*  U  F, 
and  since  A:  0  D,  then  ^*  >  1  or  <  1.  This  results  because  DMUfc  will  compare  to  the 
remaining  frontier  formed  by  the  efficient  DMUs  of  D.  In  this  case  9*  can  be  interpreted 
in  terms  of  the  Farrell  measure  [51];  DMUjt  izt&y  increase  its  inputs  by  a  factor  of  and 
remain  frontier  efficient.  Correspondingly,  for  the  output-oriented  model,  DMUfc  may 
decrease  its  outputs  by  a  factor  of  and  remain  frontier  efficient. 

Chames,  Cooper,  and  Thrall  place  the  scale-efficient  DMUs  for  the  CCR  model 
into  three  categories  which  they  label  E,  E',  and  F  (41).  For  the  input-oriented  DEA 
model,  DMUfc  e  E  U  E/  U  F  if  and  only  if  =  1  (i.e.,  DMUfc  is  frontier  efficient). 
Mathematically,  the  members  of  E  are  extreme  rays  (CCR  model)  or  extreme  points 
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(BCC  model)  which,  when  taken  with  their  associated  facets,  form  a  piecewise-empirical 
production  surface,  or  Pareto-Koopmans  frontier  [33].  For  the  ECCR  and  EBCC  models, 
only  members  of  E  will  have  6*  >  1.  Members  of  E'  are  non-extreme  rays  (or  non¬ 
extreme  points)  but  depict  DMUs  which  fall  on  the  frontier  and  are  Pareto-Koopmans 
efficient.  For  the  ECCR  and  EBCC  models,  only  members  of  E/  will  have  (?*  =  1,  s*  =  0, 
and  s°  =  0  with  an  efficient  reference  set  consisting  of  DMUs  which  are  members  of 
E*.  DMUs  which  are  elements  of  F  represent  DMUs  on  the  frontier  but  which  are  not 
Pareto-Koopmans  efficient.  For  the  ECCR  and  EBCC  models  only  members  of  F  will 
have  6*  =  \  and  s*  ^  0  or  s°  ^  0  with  an  efficient  reference  set  consisting  of  DMUs 
which  are  members  of  E*.  It  is  important  to  note  that  if  DMUfc  is  a  member  of  F  with 
an  efficient  reference  set  containing  other  members  of  F  then,  for  the  ECCR  and  EBCC 
models,  the  optimal  solution  for  DMU*  may  result  in  0*  =  1,  s*  =  0,  and  8°  =  0.  In  this 
case,  members  of  E/  and  F  can  be  distinguished  if  the  efficient  reference  set  is  retained  in 
the  solution  of  ECCR  and  EBCC,  for  once  the  solution  for  all  DMUs  in  D  are  found,  all 
members  of  F,  E,  and  E*  can  be  identified.  Therefore,  the  two-phase  approach  suggested 
by  Ah  [3]  to  identify  members  of  F  is  not  necessary. 

Some  important  observations  can  be  made  concerning  the  above  classification 
scheme.  With  input-oriented  DEA  analysis,  the  transformation 

x*^e*Xk-8**,  +  (4.11) 

projects  DMUjt  onto  an  associated  Pareto-Koopmans  frontier-efficient  facet.  Correspond¬ 
ingly,  the  transformation 


XJi^Xk-8^*,  Yi:^(t>*Yk  +  8°* 


(4.12) 
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projects  DMUfc  onto  an  associated  Paieto-Koopmans  frontier-efficient  facet  for  the  output- 
oriented  models  [33].  In  either  case,  the  projected  DMU]^  consisting  of  (V]^,  is  called 
the  virtual  representation  of  DMUfc  [37] . 

The  virtual  representation  DMU]^  at  determines  an  efficient  projection 

that  is  a  member  of  E  U  E',  i.e.,  Pareto-Koopmans  efficient.  However,  other  transforma¬ 
tions  are  ]x>ssible  that  provide  important  insights.  The  transformation 

Yj^^Yk  (4.13) 

will  produce  an  efficient  projection  for  the  ECCF?  and  EBCC*  input-oriented  models, 
which  will  be  a  member  of  E  U  E'  U  F  (i.e.,  be  frontier  efficient).  Likewise,  for  the  output- 
oriented  models  ECCR?  and  EBCC**,  the  transformation 

Xl*-Xk,  Yj:*-^*Yk  (4.14) 

will  also  produce  an  efficient  projection,  DMUJ,  which  is  a  member  of  E  U  E'  U  F. 
Consequently,  the  primary  purpose  of  the  slacks  in  DEA  models  is  to  insure  that  the 
virtual  transformation  DMU]^  is  Pareto-Koopmans  efficient  and  not  simply  a  DMU  on 
the  efficient  frontier. 

However,  the  restriction  to  limit  projections  to  Pareto-Koopmans-efficient  points 
is  not  necessary  when  determining  the  allowable  perturbations  that  will  cause  a  DMU  to 
change  status,  for  any  weakly  efficient  DMU  that  has  all  inputs  or  aU  outputs  perturbed 
by  a  small  amount  will  become  either  Pareto-Koopmans  efficient  or  completely  inefficient. 
This  has  important  implications  for  the  EDEA  models  and  corresponding  sensitivity 
analysis.  For  the  ECCR*  and  EBCC*  input  models,  6*  determines  the  maximiiTn 
proportional  changes  in  all  inputs  of  DMUjt  which  allow  it  to  maintain  its  current  status. 
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For  ECCB^  and  EBCC^,  the  output  oriented  models,  0*  determines  the  nnucimnm 
proportional  changes  in  all  outputs  of  DMUjt  which  allow  it  to  maintain  its  current 
status.  Consequently,  these  facts  can  be  beneficial  in  detmnining  the  allowable  variations 
in  inputs  or  outputs  which  will  not  change  the  status  of  DMUj^. 

4.3.  CGR  and  BCC  Sensitivity  Procedures 
The  following  procedures  are  used  to  determine  the  maximum  allowable  changes 
to  inputs  and  outputs  before  a  DMU  changes  its  status.  The  procedures  differ  slightly  for 
the  OCR  and  BCC  input-  and  output-oriented  models.  The  procedure  must  be  applied 
to  each  DMU  in  the  set  in  which  sensitivity  analysis  is  desired. 

Set  C  =  -Xj  and  ri  =  Yj  of  the  DMU  j  under  consideration. 

Procedure  SensCCR*0,  *7)* 

1.  Set  Xj  ^  and  Yj  *-  rf. 

2.  Solve  ECCR). 

3.  Let  x;  ^  e*Xj. 

4.  AXj  =  XJ  -  Xj. 

Procedure  SensCCR®(j,  ly): 

1.  Set  Xj  ♦-  ^  and  Yj  *—tj. 

2.  Solve  ECCB^. 

3.  Let  r/  ^  ^*Yj. 

4.  AYj  =  y/  -  Yj. 

Procedure  SensBCC*C7,  C.  v)’ 

1.  Set  Xj  *-  C  and  Yj  *-  rf. 

2.  Solve  EBCCj. 

3.  If  EBCC^  is  infeasible  then: 

a.  DMUj  is  efficient. 
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b.  AXj  =  00. 

Else: 

a.  Let  X*j  ♦-  e*Xj. 

b.  AXj  =  X;  -  Xj. 

Procedure  SensBCC®(j,  Ci  *?)• 

1.  Set  Xj  *—  C  and  Yj  *-  ri. 

2.  Solve  EBCC^. 

3.  If  EBCCj  is  infeasible  then: 

a.  DMUj  is  efficient. 

b.  AYj  =  Yj. 

Else: 

a.  Let  Yf  ^  ^*Yj. 

b.  AYj  =  y/  -Yj. 

In  the  EDEA  input  models,  if  >  1,  then  DMU*  is  efficient,  and  the  maximum 
increase  of  inputs  that  allows  the  DMU  to  remain  efficient  is  AX^  =  0*Xk  —  Xk-  If 
<  1,  then  DMUfc  is  inefficient  and  AXk  determines  the  necessary  decrease  of  inputs 
to  cause  the  DMU  to  become  efficient.  Note  that  AXk  identifies  ranges  such  that  if  all 
perturbations  remain  less  than  the  AXk  values,  the  DMU  will  retain  its  current  status. 
Therefore,  the  effects  of  non-proportional  variations  of  the  inputs  on  the  DMU’s  status 
can  be  evaluated.  It  is  also  important  to  note  that  only  when  all  perturbations  exceed 
the  AXk  ranges  will  the  DMU  be  guaranteed  to  change  its  status.  If  even  one  input 
perturbation  is  smaller  than  the  allowable  range,  a  change  in  status  is  not  assiued  for  the 
DMU.  This  will  be  discussed  further  in  Section  4.4. 

A  similar  interpretation  can  be  made  concerning  the  EDEA  output-oriented  mod¬ 
els.  In  these  models,  allowable  proportional  changes  in  outputs  are  determined  by 
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Ay*  =  ^*y*  —  y*.  if  ^*  <  l,  DMU*  is  efficient  and  the  maximum  allowable  decrease  of 
outputs  is  given  by  Ay*.  For  0*  >  1,  DMU*  is  inefficient  and  AF*  identifies  the  required 
increases  of  the  outputs  of  DMU*  which  will  cause  the  DMU  status  to  change. 

In  the  BCC  model,  the  EDEA  linear  programming  formulations  may  be  infeasible 
for  the  DMU  of  interest,  DMU*,  in  either  the  input  or  the  output  model.  For  the  input- 
oriented  model,  this  would  mean  that  DMU*  is  producing  one  or  more  outputs  in  an 
amount  greater  than  any  other  DMU.  Consequently,  regardless  of  how  much  the  inputs 
of  DMU*  are  increased,  DMU*  wiU  remain  Pareto-Koopmans  efficient  at  its  own  scale 
level.  For  the  output-oriented  model,  an  infeasible  solution  indicates  there  is  at  least  one 
input  for  which  DMU*  consumes  less  than  any  other  DMU.  In  this  case,  outputs  can  be 
reduced  to  zero  and  the  DMU  will  remain  efficient  at  its  own  scale  level.  Because  of  their 
special  characteristics,  these  DMUs  will  be  termed  critically  efficient  because  eliminating 
them  from  the  data  set  critically  alters  the  feasible  region.  Pre-processing  of  the  data  can 
reveal  these  DMUs  prior  to  performing  any  sensitivity  analysis.  It  is  important  to  note, 
that  the  problem  of  infeasibility  is  unique  to  the  BCC  model  and  not  the  CCR  model 
because  of  the  feasible  regions:  the  feasible  region  of  the  BCC  model  is  a  closed  convex 
set,  whereas  the  feasible  region  of  the  CCR  model  is  a  convex  cone. 

The  advantage  of  EDEA  over  previous  models  for  conducting  sensitivity  analysis 
is  that  it  separates  the  effects  of  perturbations  of  inputs  from  those  of  the  perturbations 
of  outputs.  In  general,  the  results  are  only  meaningful  for  measuring  effects  of  pertur¬ 
bations  in  inputs  alone,  or  outputs  alone,  but  not  both  simultaneously.  Additionally, 
the  method  requires  few  computational  resources  and  can  be  used  simultaneously  with 
determining  the  desired  DEA  efficiency  scores. 
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4.4.  Additi\«  Model  Sensitivity  Analj^sis 
Chames,  Haag,  Jaaka  and  Semple  [43]  (CHJS)  presented  the  following  procedtire 
for  performing  sensitivity  analysis  in  DEA  based  on  the  additive  model.  Given  the 
empirical  points  (Xj,yj),  j  =  with  input  vector  Xj  >  0  with  at  least  one 

Xij  positive  and  output  vector  Vj  >  0  with  at  least  one  yrj  positive,  Chames,  et.  aJ.  [43], 
define  the  empirical  production  set  as 

{n  n 

yGr,x€5r|(y,x)  =  53Ai(yi,XO;  ^Xi  =  l,  Ai  >  0 

t=i  t=i 

where  A*  are  real  scalar  coefficients. 

The  efficiency  of  a  DMU  is  determined  by  comparing  its  vector  of  outputs  and 
inputs  with  the  empirical  production  set.  The  DMU  of  interest  is  efficient  if  its  vector  is 
Pareto-Koopmans  efficient  with  respect  to  PE. 

A  unique  featme  of  the  additive  model  is  that  it  does  not  require  an  input  or 
output  orientation.  The  additive  model  is  given  as: 


{ADDk)  max  z  =  ls°  +  Is*  (4.15) 

s.t.  YA-s°=Yfc  (4.16) 

XA  +  s*  =  Xfc  (4.17) 

lA  =  1  (4.18) 

A,s°,s*>0.  (4.19) 


DMUjfc  is  efficient  if  and  only  if  z*  =  0  at  optimality.  As  in  all  DEA  formulations,  with 
this  method,  a  hnear  program  must  be  solved  for  each  DMU  tested. 

The  purpose  of  the  CHJS  paper  is  to  present  a  new  procedure  to  conduct  sen¬ 
sitivity  analysis  for  a  specific  class  of  DEIA  models  which  allows  for  a  simultaneous 
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proportional  change  in  outputs  and  inputs.  For  each  DMU,  CHJS  calculate  a  “region 
of  stability,”  that  is,  a  symmetric  cell  such  that  all  perturbations  within  the  cell  maintain 
the  DMU’s  status.  The  shape  of  the  cell  is  affected  by  the  norm  used.  The  authors 
present  an  analysis  based  on  the  Loo  norm  as  well  as  the  Li  norm.  Only  the  Loo  norm 
will  be  discussed  since  the  same  conclusions  can  be  reached  for  the  Li  norm. 

To  vmderstand  the  CHJS  procedure,  a  few  definitions  are  necessary.  Given 

DMU*,  the  radius  of  stabihty  is  the  largest  number,  rp,  such  that  arbitrary  perturbations 

to  DMUjfc  with  p-norm  strictly  less  than  Vp  preserve  the  DMU’s  current  state.  The  Loo 

norm  is  defined  as  ||z|ioo  =  max  \zi\. 

i 

A  characteristic  of  the  CHJS  approach  is  that  the  analysis  must  be  carried  out 
differently  for  efficient  DMUs  and  inefficient  DMUs.  In  the  CHJS  approach,  a  procedure 
is  needed  to  first  determine  the  efficiency  status  of  each  DMU.  Once  this  status  is 
determined,  sensitivity  analysis  can  be  conducted  on  each  DMU  with  an  appropriate 
procedure,  depending  if  the  DMU  is  efficient  or  inefficient.  Note,  with  the  EDEA 
formulation,  computational  efficiency  is  attained  because  one  process  determines  the 
efficiency  status  and  the  necessary  information  to  conduct  sensitivity  analysis  for  each 
DMU.  The  CHJS  sensititivity  analysis  can  be  described  as  follows.  For  efficient  DMUfc, 
the  authors  describe  the  Looc  model  as: 


(Looc-norm  Model)  min  0  (4.20) 

s.t.  Y^A  -  +  10  =  Yfc  (4.21) 

xWA  +  s<-10  =  Xfc  (4.22) 

lA  =  1  (4.23) 

A,s°,sS0>O.  (4.24) 


If  DMUjfc  is  inefficient,  the  Loot  model  must  be  used.  It  is  written  as: 
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(Loo*-norm  Model)  max  d  (4.25) 

s.t.  yI^'U  -  -  1^  =  Y*  (4.26) 

+  s*  +  l&  =  X;,  (4.27) 

lA  =  1  (4.28) 

X,3°,s\0>O.  (4.29) 


The  Looc  model  determines  the  maximum  permissible  decrease  in  any  output  with 
a  simmcaneous  increase  in  any  input  before  the  efficient  DMU  becomes  inefficient.  On  the 
other  hand,  the  Loot  model  finds  the  required  decrease  of  inputs  along  with  an  increase 
in  outputs  that  the  inefficient  DMU  must  witness  before  it  changes  status  and  becomes 
efficient. 

The  authors  present  proofs  demonstrating  that  0*  for  both  models  is  the  radius 
of  stability  roo-  Having  determined  0*,  the  authors  claim  that  allowable  perturbations  to 
an  efficient  DMU  of  the  form 


where  r  €  [0, 0*),  preserve  efficiency  for  the  DMU.  No  corresponding  analysis  is  presented 
for  the  inefficient  DMUs,  but  the  allowable  perturbations  for  the  inefficient  DMUs  would 
take  the  form 

where  r  €  [0,  (9*).  Only  when  t  >  0*  will  a  change  in  status  be  assured  for  the  inefficient 
DMU. 
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The  authors  claim  that  the  projected  points  found  when  t  =  0*  always  form  an 
unstable  point,  and  therefore  prove  the  region  of  stabihty  is  valid.  A  point  {Yj,Xj)  is 
unstable  if  and  only  if  for  all  c  >  0,  the  open  ball  centered  at  {Yj,  Xj)  with  radius  c  con¬ 
tains  efficient  and  inefficient  points  with  respect  to  the  associated  empirical  production 
possibihty  set. 

4.4.1.  Sensitivity  Analysis  Example 

The  results  of  the  CHJS  approach  will  now  be  compared  with  the  ECCR*, 
ECCR°,  EBCC*  and  EBCC°  models’  results  using  the  data  presented  in  the  CHJS 
paper.  In  this  example,  there  are  three  DMUs,  each  with  one  output  and  two  inputs,  as 
shown  in  Table  4.1.  The  results  of  an  analysis  of  efficient  DMUyi  are  given  in  Table  4.2. 


Table  4.1.--  Sensitivity  analysis  example 


DMU^ 

DMUb 

DMUc 

LOO 

0.25 

1.00 

1.00 

^1 

1.00 

1.00 

0.25 

Table  4.2.  —  Comparative  sensitivity  results 


EDEAi 

CHJS 

MsUIsQM 

ECCR° 

EBCC° 

AyiA 

— 

— 

-0.643 

Axia 

infeas. 

— 

■HI 

infeas. 

— 

0.750 

To  make  proper  comparisons  between  the  EDEA  models  and  the  CHJS  approach, 
the  EBCC  models  should  be  observed.  The  infeasibiUty  of  the  EBCC*  model  indicates 
that  DMUx  is  efficient  at  its  own  scale  size,  therefore,  the  inputs  can  be  increased 
indefinitely  and  the  DMU  will  remain  efficient.  The  EBCC°  model  indicates  how  far 
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the  output  can  be  reduced  for  DMUyi  to  remain  efficient.  In  this  case,  as  long  as 
Via  >  0>25,  DMUyi  will  remain  efficient.  With  the  information  from  both  the  input 
and  output  models,  one  could  conclude  that  simultaneous  changes  can  occur  in  both  the 
inputs  and  outputs  and  DMU^i  will  remain  efficient.  As  long  as  yi^  >  0.25,  inputs 
can  increase  indefinitely  and  DMUy4’s  status  will  remain  unchanged.  Notice,  the  CHJS 
approach  yields  similar,  although  more  limited,  results.  The  CHJS  results  indicate  that 
the  inputs  and  outputs  can  simultaneously  be  perturbed;  as  long  as  yi^  >  0.25  and 
xiA  <  1-75,  and  X2A  <  1-75,  DMUa  wiU  be  efficient.  Compared  to  the  EDEA  models,  it 
is  evident  that  the  CHJS  results  place  an  artificial  limit  on  the  allowable  perturbations 
of  the  inputs. 

An  additional  benefit  of  the  EDEA  sensitivity  analysis  approach  is  that  the 
ECCR*  and  ECCR?  models  reveal  information  on  the  perturbations  which  allow  DMU^i 
to  remain  scale  efficient.  Information  on  any  aspects  of  scale  efficiency  are  unavailable 
with  the  CHJS  approach. 

To  compare  the  CHJS  approach  to  the  ECCR^  and  ECCR°  models,  a  simple 
scaling  of  the  data,  so  that  all  DMUs  are  at  the  same  scale  size,  is  useful.  Since  there  is 
only  one  output,  this  scaling  can  be  accomplished  by  dividing  all  data  for  each  DMU  by 
its  output  value.  The  normed  data  are  presented  in  Table  4.3. 

Since  all  DMUs  produce  a  single  output  at  the  same  level,  a  unit  isoquant  plot  of 
the  data  is  possible  as  depicted  in  Figure  4.1.  In  this  example,  DMU  A  is  efficient,  B  is 
weakly  efficient,  and  C  is  inefficient. 


Table  4.3.  --  Normed  data  example 


DMU^ 

DMUs 

DMUc 

1.00 

1.00 

1.00 

1.00 

1.00 

4.00 

WEm 

1.00 

4.00 

2.00 
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Figure  4.1.  Normalized  data  isoquant  plot 


Table  4.4.  --  Normed  data  sensitivity  results 


EDEA 

CHJS 

mminm 

EBCC* 

ECCR° 

m^zTSTtum 

^y\A 

— 

— 

infeas. 

Axia 

— 

— 

■flRiiiW 

AX2A 

BEiSi 

— 

— 

1.800 

Results  for  the  EDEA  and  CHJS  sensitivity  models  are  given  in  Table  4.4.  The 
infeasibility  of  the  EBCC°  model  indicates  that  reductions  in  output  are  possible;  as 
long  as  output  is  greater  than  zero,  the  DMU  will  remain  Pareto-Koopmans  efficient. 
Note  however,  if  yiA  <  (1  —  0.643)  =  0.357,  DMUyi  will  no  longer  remain  scale  efficient. 
The  ECCB}  input-oriented  models  indicate  that  as  long  as  Ya  is  held  fixed,  inputs  can 
be  increased  to  2.8  and  DMU  A  will  remain  both  scale  and  Pareto-Koopmans  efficient. 
In  fact,  when  both  the  ECCW  and  ECCBF  models  are  paired,  one  would  conclude 
that  simultaneous  perturbations  of  both  outputs  and  inputs  are  possible.  As  long  as 
inputs  remain  less  than  2.8,  output  can  be  reduced  to  0.357  and  DMU^  will  remain  scale 
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efficient.  DMUyi  will  remain  Pareto-Koopmans  efficient  even  if  output  is  reduced  to  0,  as 
long  as  inputs  remains  less  than  2.8. 

The  results  of  CHJS  models  reveal  a  potential  problem  with  their  approach. 
The  results  indicate  as  long  as  the  inputs  remain  less  than  2.8,  DMU  A  will  remain 
Pareto-Koopmans  efficient,  even  with  simultaneous  reductions  in  outputs.  Further,  A 
will  remain  efficient,  according  to  CHJS,  even  if  output  is  reduced  to  yiA  ~  —0.800.  This 
clearly  is  not  true.  The  causes  of  the  problem  can  be  identified.  The  authors  claim  the 
projection 


*.2 

* 

1 

0 

Sh 

1 _ 

■-0.8' 

= 

Xlo  +  0* 

= 

2.8 

.^2o. 

X2o  + 

2.8 

is  an  unstable  point.  The  projection  must  be  an  unstable  point  for  the  region  of  stability 
to  be  legitimate.  Generally,  it  is  assumed  that  at  least  one  output  must  be  positive 
in  efficient  production  analysis,  such  as  DEA.  Therefore,  the  projected  point  cannot 
be  unstable.  In  fact,  the  projected  point  cannot  be  a  valid  member  of  the  modified 
empirical  production  possibility  set  PE.  Consequently,  althoxigh  the  CHJS  approach 
provides  important  insights  into  sensitivity  analysis  for  DEA,  there  are  special  cases  that 
are  problematic  with  that  approach. 

4.5.  Expanded  Sensitivity  Analysis 

An  attractive  feature  of  the  EDEA  model  is  its  simple  implementation  using 
existing  DEA  methodologies.  However,  a  concern  of  this  approach  is  that  the  resulting 
region  of  stability  may  be  a  small  subset  of  the  true  allowable  perturbations  for  the  DMU 
to  maintain  its  status.  Figure  4.2  illustrates  this.  The  shaded  area  above  points  A,  E, 
E'  and  below  points  A,  B,  C,  D  represents  the  allowable  perturbations  for  which  DMU 
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Figure  4.2.  Total  region  of  stability 

E  remains  efficient.  The  EDEA  method  reveals  a  region  of  stability  represented  by  the 
shaded  box. 

A  larger  region  of  stability  is  revealed  by  repeated  application  of  the  EDEA 
methodology,  as  described  below.  Procedures  ESA-I  and  ElSA-0  characterize  a  series  of 
stable  regions  for  changes  in  the  input  and  output  vectors,  respectively,  for  a  given  DMU 
j  to  the  degree  of  refinement  desired.  Recall  that  R  =  {1, . . . ,  s}  and  Q  =  {1, . . . ,  m} 
are  the  respective  index  sets  for  output  and  input  measures.  Initially,  let  C  =  and 
rj  =  Yj. 

Procedure  ESA-I(j,  C.  v)- 

1.  Apply  procedure  Sen8CCR*(j,  C.  v)  or  SensBCC*(7,  C,  v)  to  compute  . 

2.  If  the  region  of  interest  is  not  fully  characterized,  then: 

For  all  9  €  Q:  solve  ESA-I(j,  +  ^^qj) 

Procedure  ESA-0(j,C»»7): 

1.  Apply  procedure  Sen8CCR®(j,  C. »?)  or  SensBCC®(j,  C,  i?)  to  compute  AY^. 

2.  If  the  region  of  interest  is  not  fully  characterized,  then: 

For  all  r  e  R:  solve  ESA-0(j,  C>  +  ^r^Vrj) 
where  ejfc  is  a  column  vector  of  zeros  with  a  1  in  the  row. 
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Figure  4.3.  Expanded  sensitivity  analysis 


Figure  4.3  reveals  how  one  iteration  of  this  procedure  expands  the  region  of 
stabiht>  for  an  efficient  DMU,  E.  Because  this  DMU  contains  only  one  output,  only  the 
ESA-I  portion  of  the  algorithm  is  relevant.  Notice,  the  ESA  algorithm  can  be  appUed 
iteratively  to  yield  larger  regions  of  stability  if  desired.  However,  the  number  of  linear 
programs  required  to  trace  larger  regions  grows  rapidly.  Table  4.5  shows  the  input  and 
output  values  for  the  DMUs  depicted  in  Figure  4.3. 


Table  4.5.  --  Expanded  sensitivity  example  results 


DMUyi  DMUb 

DMUc 

DMUz?  DMUb 

DMUb* 

DMUb;  DMUbj 

yij 

1.00 

1.00 

1.00 

1.00  1.00 

Xlj 

7.00 

1.50 

3.25 

5.84  2.00 

X2j 

2.50 

1.50 

3.25 

2.69  4.33 

The  results  of  the  expanded  analysis  now  yield  greater  information  than  the 
EDEA  sensitivity  analysis.  With  straight  EDEIA  sensitivity  analysis,  results  indicate 
that  for  DMU  E,  both  inputs  can  be  increased  from  1.5  to  3.25  before  the  DMU  becomes 
inefficient.  However,  the  expanded  analysis  yields  more  detailed  information.  DMU  Ef 


indicates  that  as  long  as  X2E  ^  2.69,  xie  can  be  increased  to  5.84  and  E  will  remain 


efficient.  Likewise,  DMU  E2  indicates  X2E  can  be  increased  to  4.33  as  long  as  xie  <  2.00. 
As  can  be  seen  in  this  example,  the  expanded  sensitivity  analysis  reveals  a  greater  region 
of  stability  in  which  non-proportional  increases  in  inputs  can  be  witnessed  while  the 
DMU  maintains  its  status.  This  same  analysis  can  apply  to  an  inefficient  DMUs  as 
depicted  by  DMU  I  in  Figure  4.4. 


Figure  4.4.  Inefficient  DMU  expanded  sensitivity  analysis 

Because  both  input  and  output  models  must  be  solved  for  each  DMU,  the  number 
of  linear  programs  required  to  trace  the  expanded  region  grows  rapidly.  For  each  DMU, 
(s  +  m)  additional  LPs  must  be  solved  for  each  additional  increase  in  the  region  of 
stability  that  is  measured.  Note  however,  the  CHJS  Li  norm  technique  for  inefficient 
DMUs  also  requires  (s  +  m)  LPs  to  be  solved. 

4.6.  Summary 

The  motivation  for  the  sensitivity  analysis  for  the  efficient  DMUs  difiers  substan¬ 
tially  from  that  for  the  inefficient  DMUs.  The  managers  of  the  efficient  DMUs  desire 
to  know  how  much  loss  in  productive  efficiency  is  allowable  before  the  DMU  becomes 
relatively  inefficient  compared  to  its  peers.  The  inputs  that  permit  only  small  increases 
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before  efficiency  is  lost  play  a  different  role  in  the  production  process  t.h«.Ti  the  inputs 
which  can  suffer  large  perturbations  and  not  result  in  a  change  of  state.  This  infonnation 
can  give  the  manager  a  sense  of  the  relative  worth,  or  price  value,  of  each  input.  In  this 
way,  managers  can  plan  for  futxire  markets  by  shifting  to  production  technologies  that 
provide  greater  regions  of  stability.  Prom  the  output-oriented  model,  the  manager  is  able 
to  gain  a  more  complete  picture  of  the  role  of  each  output.  By  knowing  which  outputs 
are  most  critical  to  maintain  relative  efficiency,  the  manager  is  more  able  to  evaluate  the 
opportunity  costs  associated  with  producing  each  output. 

For  the  inefficient  DMUs,  the  manager  wants  to  identify  the  smallest  region  of 
stability.  This  will  reveal  how  best  to  allocate  resources  to  move  to  efficiency.  Instead  of 
down-sizing  across  all  inputs  with  proportional  reductions,  the  manager  may  be  able  to 
focus  on  reducing  those  resources  that  will  more  rapidly  bring  the  DMU  to  the  efficient 
frontier.  Likewise,  by  focusing  on  output  expansion,  the  manager  may  be  able  to  identify 
a  market  niche  with  a  few  of  outputs  in  which  the  DMU  can  achieve  relative  efficiency. 

The  sensitivity  analysis  presented  in  this  chapter  presents  the  manager  with 
useful  information  about  the  role  each  output  and  input  variable  plays  in  determining 
the  efficiency  of  the  production  process.  In  this  maimer,  the  manager  can  concentrate 
attention  on  those  factors  which  can  more  quickly  result  in  increased  productive  efficiency 
when  considering  pmlicy  decisions. 


118 


9 


CHAPTER  V 


SUMMARY  AND  CONCLUSIONS 

This  dissertation  contributes  to  the  development  of  solution  techniques  for  solving 
large-scale  data  envelopment  analysis  problems  which  involve  the  solution  of  thousands 
of  linear  programs.  Although  the  focus  is  on  large-scale  problems,  the  new  algorithms 
presented  can  be  used  to  effectively  solve  a  wide  range  of  DEA  problems. 

The  analysis  presented  in  Chapter  II  describes  a  new  problem  decomposition 
procedure  which  dramatically  expedites  the  solution  of  the  computationally  intensive 
DEA  problems  and  fully  exploits  parallel  processing  environments.  Computational  ex¬ 
periments  indicate  that  by  combining  linear  programming  theory  with  known  character¬ 
istics  of  the  DEA  formulations,  solution  times  can  be  cut  in  half  compared  to  general 
linear  programming  approaches.  When  utilizing  a  parallel  processing  environment,  the 
efficiency  remains  above  0.97  indicating  that  the  solution  time  will  improve  by  the  same 
magnitude  as  the  number  of  processors  used.  A  new  decomposition  algorithm  is  pre¬ 
sented  which  causes  a  6-  to  10-fold  improvement  in  the  solution  time  for  the  serial  case. 
When  the  decomposition  procedure  is  coupled  with  the  parallel  processing  environment, 
solution  times  can  be  increased  by  up  to  two  orders  of  magnitude. 

Besides  effectively  reducing  the  time  to  solve  large-scale  DEA  problems,  the  de¬ 
composition  approach  offers  many  additional  advantages  over  prior  DEA  solution  meth¬ 
ods.  It  allows  for  the  solution  of  a  large  number  of  different  DEA  formulations  in  rapid 
succession.  The  decomposition  approach  opens  a  new  means  of  conducting  window  anal¬ 
ysis  as  well  as  studying  categorical  variables.  Additionally,  the  decomposition  approach 
helps  avoid  problens,  such  as  cycling,  which  frequently  occur  in  larger  DEA  problems. 
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Building  on  these  improvements  to  the  DEA  solution  process,  Chapter  III  outlines 
a  new  rank  ordering  procedure  to  classify  DMUs  by  their  ability  to  apply  current  technol¬ 
ogy  to  the  production  process.  This  new  ranking  procedure  acts  as  a  proxy  to  measure 
managerial  effectiveness  in  a  way  not  possible  by  traditional  DEA  methods.  Best  and 
worst  performing  DMUs  are  easily  classified  with  the  new  ordering  procedure.  In  this 
way,  the  analyst  can  compare  the  management  practices  of  the  DMUs  from  both  clas¬ 
sifications  to  assist  in  determining  which  policies  improve  and  detract  from  productive 
efficiency. 

In  Chapter  IV,  we  presented  a  computationally  intensive  means  of  conducting 
sensitivity  analysis  on  the  DEA  results.  In  the  chapter,  the  new  sensitivity  analysis 
procedure  is  compared  with  previous  approaches.  The  new  procedure  is  shown  to  avoid 
problems  that  may  arise  with  the  previously  reported  methods.  The  key  advantage 
of  the  sensitivity  analysis  is  that  it  helps  clarify  which  factors  can  cause  significant 
improvements  to  the  production  process. 

The  study  of  large-scale  DEA  problems  reveals  many  new  issues  that  have  gone 
imnoticed  with  the  study  of  smaller  problems.  The  insights  gained  by  investigating  these 
new  problems  can  continue  to  result  in  new  analytical  tools  to  broaden  the  usefulness  of 
DEA.  The  techniques  presented  in  this  dissertation  open  the  door  to  new  research  which 
will  further  expand  the  range  of  applications  for  data  envelopment  analysis. 
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APPENDIX  A 


MULTINORMAL  DATA  GENERATOR 

Since  few  large-scale  DEA  problems  are  currently  available,  randomly  generated 
data  sets  were  needed  to  simulate  possible  real  life  data.  One  procedure  to  generate  such 
data  is  to  draw  the  samples  from  a  multinormal  distribution.  To  this  end,  the  DRNMVN 
routine  from  the  IMSL  hbrary  was  used  to  generate  data  for  10,000  DMUs.  The  variance 
covariance  matrix  of  Table  A.l  was  used  to  generate  the  data  for  the  input  and  output 
variables  for  each  DMU.  The  code  to  generate  this  data  is  described  by  Hickman[62]. 


Table  A.l.  —  Variance-Covariance  matrix  for  multinormal  data 


[_ 

Xi 

X2 

X3 

X4 

X5 

X6 

X7 

Xg 

Qa 

303.4 

2.5 

2.5 

12493.9 

6093.9 

112750625.0 

X4 

4062.6 

484.2 

-2890405.7 

31033992.2 

X5 

265.2 

45.3 

-1346076.8 

-455158.7 

7629765.5 

X6 

19597.2 

-195.0 

3436081.9 

-270405.4 

-674642.7 

86492323.0 

X7 

36418.8 

6428.4 

111950224.4 

27415022.3 

5153887.3 

88983356.8 

233502490.7 

Xg 

3647.0 

1001.0 

10815783.1 

112293.8 

-64824.7 

-  319883.0 

10543369.3 

7812141.0 

From  the  randomly  generated  data,  the  variable  representing  inputs  and  outputs 
needed  to  be  carefully  chosen.  To  coincide  with  sound  economic  theory,  all  outputs 
should  be  positively  correlated  with  all  the  inputs.  In  Fig.  A.l,  all  negatively  correlated 
variables  are  connected  with  a  line. 

By  eliminating  variable  X4,  variables  xi,  X3,  and  X7  could  represent  outputs 
since  they  would  be  positively  correlated  with  all  the  other  remaining  variables  that 
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would  represent  inputs.  In  this  way,  each  DMU  would  be  comprised  of  3  outputs 
and  4  inputs.  Notice  also  that  inputs  positively  correlated  would  be  comphments  in 
the  production  process  and  those  negatively  correlated  would  be  substitutes.  Since  the 
input  variables  cover  a  wide  range  of  compliment  and  substitute  relationship  cases,  the 
randomly  generated  data  further  simulates  possible  real  life  data. 
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APPENDIX  B 


DEA-GEN 

While  random  data  generators  can  provide  large  problems,  a  more  systematic 
approach  is  need  to  represent  real  hfe  scenarios.  To  this  end,  economic  theory  was  used 
to  generate  large-scale  DEA  problem  sets.  In  production  economics,  the  most  widely  used 
functional  form  is  known  as  the  Cobb-Douglas  production  function  [24].  This  function  is 
written  as: 

m 

Vj  —  do  X^j  ,  Xij  >  0,  ^  =  1, .  . . ,  71 
*=1 

Here,  yj  is  the  single  aggregate  output  produced  by  DMUj,  Xij  is  the  value  of  input  i 
used  by  DMUj  in  the  production  process,  Oi  is  the  elasticity  factor  for  input  i,  and  a© 
is  a  constant  scale  factor.  If  ct*  =  1  then  only  constant  returns  to  scale  exist  in 
the  production  process.  For  «*  <  1  decreasing  returns  to  scale  are  present,  while 
SSi  Oi  >  1  indicates  increasing  returns  to  scale.  In  the  DEA  studies,  increasing  returns 
to  scale  are  not  used  because  the  function  results  in  only  a  few  DEA  efficient  points. 
Although  this  does  not  pose  a  problem,  it  does  not  realistically  represent  true  life  data. 

For  the  single  output  model,  this  production  function  has  many  desirable  prop¬ 
erties.  If  the  inputs  are  randomly  generated,  the  function  generates  output  values  that 
will  always  lie  on  the  production  possibility  frontier,  i.e.,  th^  will  be  DEA  efficient  for 
^  1-  This  frontier  is  central  to  the  theory  of  economic  growth  and  measures 
the  rate  of  technological  progress.  The  Cobb-Douglas  frontier  represents  the  best  use 
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of  technology  as  well  as  the  best  management  practices  to  achieve  efficient  production. 
This  coincides  with  the  practical  use  of  DEA.  Unlike  DEA,  the  quest  in  economic  studies 
is  to  attempt  to  estimate  the  values  of  aj  by  fitting  the  function  to  the  observed  data. 
By  choosing  the  values  a  priori  and  then  randomly  generating  the  input  values,  the 
output  values  can  be  determined  so  that  they  coincide  with  widely  accepted  economic 
theory.  To  insure  that  not  all  generated  data  sets  fall  on  the  efficient  frontier  surface, 
the  Oo  scale  factor  can  be  randomly  generated.  The  efficient  DMUs  will  consist  of  those 
generated  where  Oo  takes  on  it  maximum  value.  Control  over  the  number  of  DMUs  that 
are  efficient  in  the  data  set  can  be  maintained  by  limiting  the  number  of  DMUs  generated 
with  maxuo. 

In  DEA  analysis,  the  single-output,  multiple-input  scenario  is  not  of  primary 
interest.  DEA  was  developed  to  anal)rze  the  case  of  multiple-output,  multiple-input 
studies.  Fare  and  Grosskopf  [54]  point  out  that  the  existence  of  a  joint  production 
function  has  not  been  established.  That  is,  the  multiple-output,  multiple-input  model 
does  not  produce  values  strictly  on  the  efficient  production  frontier.  To  do  so,  would 
require  strict  assumptions  that  would  not  provide  the  desired  realism  for  the  DEA  study. 
However,  a  joint  model  without  the  strict  assumptions  would  simulate  economically 
sound  production  processes.  Consequently,  a  joint  model  was  developed  for  the  DEA- 
GEN  problem  generator. 

For  each  of  the  problem  sets,  the  input  veilues  were  generated  from  a  uniform 
distribution.  The  Oj  values  were  chosen  to  simulate  constant  returns  to  scale  processes  as 
well  as  a  variety  of  cases  representing  different  levels  of  decreasing  returns  to  scale.  The 
Oo  value  was  randomly  generated,  but  with  a  modest  control  of  the  number  of  efficient 
DMUs  that  could  be  present  in  the  problem  data. 

Once  the  single  aggregate  output  level  was  calculated,  the  individual  output  levels 
were  determined  by  assigning  each  individual  output  as  a  percentage  of  the  aggregate. 
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The  percentages  for  each  individual  output  were  drawn  from  normal  distributions  with 
predetermined  means  and  standard  deviations.  The  means  of  the  normal  distributions 
were  chosen  so  that  the  percentages  sum  to  one.  Table  B.l  hsts  the  aj  values  as  well 
as  the  means  and  standard  deviations  that  were  used  to  generate  twelve  different  Cobb- 
Douglas  data  sets. 


Table  B.l.  --  Parameter  values  for  DEA-GEN  code 


#DMUs 

a  Values 

Mean  Values 

Steindard  Dev. 

DEA-GENa 

.3,.2,.2,.3 

.12,.13,.13,.14,.ll 

.03,.04,.02,.01,.01 

DEA-GENa 

.3,.2,.2,.2,.l 

.20,.30,.15 

.03,.04,.02 

DEA-GENa 

wmi 

.3,.3,.2,.2 

.12,.12,.15,.14 

.03,.04,.02,.01 

DEA-GENa 

.4,.2,.l,.l,.l 

.5 

.1 

DEA-GENb 

1,000 

.03,.07,.12,.03,.12,.1,.02 

01, .05, .01 

DEA-GENb 

2,000 

.l,.l,.l,.l,.l,.l,.l 

.03, .04 

DEA-GENb 

4,000 

.4,.3,.l 

.13,.08,.02 

DEA-GENb 

.3,.17,.12,.03,.12,.2 

.01,.05,.01,.01 

DEA-GENc 

.18,.2,.14,.l,.l,.l 

.2,.12,.14,.ll,.ll 

.03,.04,.02,.01..01 

DEA-GENc 

.13,.18,.2,.14,.l,.l 

.2,.18,.15,.19 

.03,.04,.02,.01 

DEA-GENc 

.13,.2,.12,.13 

.2,.3,.15,.14,.ll 

.03,.04,.02,.01,.01 

DEA-GENc 

.08,.12,.21,.07,.ll 

.3,.3,.3,.l 

.04,.04,.1,.03 

DEA-GENd 

25,000 

.3,.3,.3 

.3,.3 

.01, .01 

Because  of  the  economic  foundations  of  the  DEA-GEN  code,  the  data  generated 
resembles  a  class  of  problems  that  should  more  closely  simulate  realistic  economic  data 
than  what  would  be  possible  from  the  data  sampled  from  a  multinormal  distribution. 
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